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GenCore version 6.2.1 
Copyright (c) 1993 - 2008 Biocceleration Ltd. 



OM protein - protein search, using sw model 

Run on: June 24, 2008, 08:32:16 ; Search time 518 Seconds 

(without alignments) 
2963.070 Million cell updates/sec 

Title: US-10-552-515-1 
Perfect score: 4950 

Sequence: 1 MRMAATAWAGLQGPPLPTLC SELSSHWTPFTVPKASQLQQ 933 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 5032670 seqs, 1645091341 residues 

Total number of hits satisfying chosen parameters: 5032670 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 

Database : UniProt_12 . 1 : * 

1: uniprot_sprot : * 
2 : uniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 

1 4950 100.0 933 1 TM16G_HUMAN Q6iwh7 homo sapien 

2 3771.5 76.2 859 1 TM16G_M0USE Q14at5 mus musculu 
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3 


3764 


76. 


.0 


860 


1 


TM16G_RAT 


Q6ift6 


rattus norv 


4 


1539 .5 


31 . 


. 1 


920 


2 


A6QLE6_BOVIN 


A6qle6 


bos taurus 


5 


1525.5 


30 . 


.8 


903 


2 


Q1AP3 6_STRPU 


Qlap36 


strongyloce 


6 


1525.5 


30 . 


.8 


955 


1 


TM16D_HUMAN 


Q32m45 


homo sapien 


7 


1523 


30 . 


.8 


921 


2 


Q1AP35_STRPU 


Qlap35 


strongyloce 


8 


1512 .5 


30 . 


.6 


981 


2 


A2AHE5_MOUSE 


A2ahe5 


mus musculu 


9 


1504 


30 . 


.4 


981 


1 


TM16C_HUMAN 


Q9byt9 


homo sapien 


10 


1500.5 


30, 


.3 


900 


2 


A1A5Z4_DANRE 


Ala5z4 


danio rerio 


11 


1497 


30, 


.2 


921 


2 


Q1AP3 7_STRPU 


Qlap37 


strongyloce 


12 


1489 


30, 


.1 


960 


1 


TM16A_M0USE 


Q8bhy3 


mus musculu 


13 


1482 


29, 


.9 


986 


1 


TM16A_HUMAN 


Q5xxa6 


homo sapien 


14 


1467.5 


29. 


.6 


913 


2 


Q8CFW1_M0USE 


Q8cfwl 


mus musculu 


15 


1464 


29 . 


.6 


999 


1 


TM16B_HUMAN 


Q9nq90 


homo sapien 


16 


1455 


29 . 


.4 


913 


1 


TM16E_HUMAN 


Q75v66 


homo sapien 


17 


1449 


29 . 


.3 


896 


2 


Q6DDQ3_XENLA 


Q6ddq3 


xenopus lae 


18 


1448 .5 


29 . 


.3 


1014 


2 


Q175J3_AEDAE 


Q175j3 


aedes aegyp 


19 


1445 


29 . 


.2 


1219 


2 


Q9VTS0_DROME 


Q9vts0 


drosophila 


20 


1443 


29, 


.2 


1235 


2 


Q2M0Y5_DROPS 


Q2m0y5 


drosophila 


21 


1438.5 


29, 


.1 


1117 


2 


Q0IEX5_AEDAE 


QOiexS 


aedes aegyp 


22 


1433 


28, 


.9 


906 


2 


Q7QDY0_ANOGA 


Q7qdy0 


anopheles g 


23 


1402.5 


28, 


.3 


910 


1 


TM16F_HUMAN 


Q4kmq2 


homo sapien 


24 


1402 


28, 


.3 


904 


1 


TM16E_M0USE 


Q75ur0 


mus musculu 


25 


1402 


28 , 


.3 


904 


2 


Q3V657_MOUSE 


Q3v657 


mus musculu 


26 


1389 


28 , 


.1 


909 


2 


A6NNM6_HUMAN 


A6nnm6 


homo sapien 


27 


1384 


28, 


.0 


1613 


2 


Q4S1F6_TETNG 


Q4slf6 


tetraodon n 


28 


1383 .5 


27, 


.9 


871 


2 


Q8JFT1_DANRE 


Q8 jftl 


danio rerio 


29 


1382.5 


27, 


.9 


412 


2 


Q4SC9 7_TETNG 


Q4sc97 


tetraodon n 


30 


1371 


27, 


, 7 


911 


1 


TM16F_M0USE 


Q6p9 j9 


mus musculu 


31 


1369.5 


27, 


. 7 


1075 


2 


Q9VDV4_DROME 


Q9vdv4 


drosophila 


32 


1367.5 


27, 


.6 


712 


2 


Q8NCT7_HUMAN 


Q8nct7 


homo sapien 


33 


1367.5 


27, 


.6 


926 


2 


Q8IN71_DROME 


Q8in71 


drosophila 


34 


1366.5 


27. 


.6 


972 


2 


Q86P2 4_DROME 


Q86p24 


drosophila 


35 


1354 


27 . 


. 4 


1062 


2 


Q2 93M2_DROPS 


Q293m2 


drosophila 


36 


1333 


26 . 


.9 


895 


2 


Q16L0 2_AEDAE 


Q16102 


aedes aegyp 


37 


1319 


26 . 


.6 


1035 


2 


Q4SSV5_TETNG 


Q4ssv5 


tetraodon n 


38 


1306 


26 . 


.4 


863 


2 


Q16MH2_AEDAE 


Q16mh2 


aedes aegyp 


39 


1290 


26, 


.1 


823 


2 


Q7QEP9_ANOGA 


Q7qep9 


anopheles g 


40 


1269 


25, 


.6 


1017 


2 


Q2 9H9 7_DROPS 


Q29h97 


drosophila 


41 


1254.5 


25, 


.3 


854 


2 


Q2VPA8_MOUSE 


Q2vpa8 


mus musculu 


42 


1249 


25, 


.2 


984 


2 


Q8MT62_DROME 


Q8mt62 


drosophila 


43 


1237.5 


25, 


.0 


971 


2 


Q4REV7_TETNG 


Q4rev7 


tetraodon n 


44 


1199 .5 


24 , 


.2 


1043 


2 


Q9VYS8_DROME 


Q9vys8 


drosophila 


45 


1199.5 


24, 


.2 


1059 


2 


Q76NS2_DROME 


Q76ns2 


drosophila 



ALIGNMENTS 



RESULT 1 
TM16G_HUMAN 

ID TM16G_HUMAN Reviewed; 933 AA. 

AC Q6IWH7; Q6IWH6; 

DT 29-MAY-2007, integrated into UniProtKB/Swiss-Prot . 
DT 05-JUL-2004, sequence version 1. 
DT 24-JUL-2007, entry version 21. 

DE Transmembrane protein 16G (New gene expressed in prostate) (Prostate 
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DE cancer-associated protein 5) (IPCA-5) (Dresden-transmembrane protein 

DE of the prostate) (D-TMPP) . 

GN Name=TMEM16G; Synonyms=NGEP, PCANAP5; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 

OC Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 1 AND 2), TISSUE SPECIFICITY, 

RP SUBCELLULAR LOCATION, AND TOPOLOGY. 

RX PubMed=14981236; DOI=10 . 1073/pnas . 0308746101; 

RA Bera T.K., Das S., Maeda H., Beers R., Wolfgang CD., Kumar V., 

RA Hahn Y., Lee B., Pastan I.; 

RT "NGEP, a gene encoding a membrane protein detected only in prostate 

RT cancer and normal prostate."; 

RL Proc. Natl. Acad. Sci. U.S.A. 101:3059-3064(2004). 

RN [2] 

RP IDENTIFICATION. 

RX PubMed=10613842; DOI=10 . 1101/gr . 9 . 12 . 1198; 

RA Walker M.G., Volkmuth W., Sprinzak E., Hodgson D., Klingler T.; 

RT "Prediction of gene function by genome-scale expression analysis: 

RT prostate cancer-associated genes."; 

RL Genome Res. 9:1198-1203(1999). 

RN [3] 

RP IDENTIFICATION. 

RX PubMed=15375614; 

RA Katoh M., Katoh M.; 

RT "Characterization of human TMEM16G gene in silico."; 

RL Int. J. Mol. Med. 14:759-764(2004). 

RN [4] 

RP ALTERNATIVE SPLICING (ISOFORM 3), TISSUE SPECIFICITY, AND INDUCTION BY 

RP ANDROGEN . 

RX PubMed=15761874; DOI=10 . 1002/pros . 20250; 

RA Kiessling A., Weigle B., Fuessel S., Ebner R. , Meye A., Rieger M.A., 

RA Schmitz M., Temme A., Bachmann M., Wirth M.P., Rieber E.P.; 

RT "D-TMPP: a novel androgen-regulated gene preferentially expressed in 

RT prostate and prostate cancer that is the first characterized member of 

RT an eukaryotic gene family."; 

RL Prostate 64:387-400(2005). 

RN [5] 

RP FUNCTION, SUBCELLULAR LOCATION, AND TISSUE SPECIFICITY. 

RX PubMed=17308099; DOI=10 . 1158/0008-5472 .CAN-06-2673; 

RA Das S., Hahn Y., Nagata S., Willingham M.C., Bera T.K., Lee B., 

RA Pastan I . ; 

RT "NGEP, a prostate-specific plasma membrane protein that promotes the 

RT association of LNCaP cells."; 

RL Cancer Res. 67:1594-1601(2007). 

CC -!- FUNCTION: May play a role in cell-cell interactions. 

CC -!- SUBCELLULAR LOCATION: Isoform 1: Cell membrane; Multi-pass 

CC membrane protein. Isoform 2: Cytoplasm, cytosol. Note=Isoform 1 

CC concentrates at sites of cell-cell contact. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=l; Synonyms=NGEP-L; 
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CC 


IsoId= 


Q6IWH7 


-1; Sequence=Displayed; 


cc 


Name=2; 


Synonyms=NGEP-S ; 




cc 


IsoId= 


Q6IWH7 


-2; Sequence=VSP_026004, VSP_026005, VSP_026006; 


cc 


Name=3 ; 


Synonyms=D-TMPP ; 




cc 


IsoId= 


Q6IWH7 


-3; Sequence=VSP_026004, VSP_026007, VSP_026008; 


cc 


-!- TISSUE SPECIFICITY: Specifically expressed in epithelial cells of 


cc 


the prostate (at protein 


level) . 


cc 


-!- INDUCTION: Up- 


regulated 


by androgen. 


cc 


-!- SIMILARITY: Belongs to the TMEM16 family. 


cc 
cc 

cc 


-!- CAUTION 


It is 


uncertain 


whether Met-1 or Met-55 is the initiator. 


Copyrighted 


by the 


UniProt Consortium, see http://www.uniprot.org/terms 


cc 
cc 

DR 


Distributed 


under 


the Creative Commons Attribution-NoDerivs License 


EMBL; AY617079; AAT40139.1; 


-; mRNA. 


DR 


EMBL; AY617C 


80; AAT40140.1; 


- ; mRNA . 


DR 


UniGene; Hs 


163909 






DR 


Ensembl; ENSG00000146205; Homo sapiens. 


DR 


HGNC; HGNC:31677; 


TMEM16G. 




DR 


MIM; 605096, 


gene . 






DR 


PharmGKB; PA32 980; 






DR 


ArrayExpress; Q6IWH7; -. 




DR 


GO; GO:0005S 


86; C: 


plasma membrane; IDA:MGI. 


DR 


InterPro; IPR007632; DUF590. 




DR 


PANTHER; PTHR12308 


; DUF590; 


1. 


DR 


Pfam; PF04547; DUF590; 1. 




PE 


1 : Evidence 


at protein level 




KW 


Alternative 


splicing; Cytoplasm; Membrane; Polymorphism; 


KW 


Transmembrane . 






FT 


CHAIN 


1 


933 


Transmembrane protein 16G. 


FT 








/FTId=PRO_0000289326 . 


FT 


TOPO_DOM 


1 


355 


Extracellular (Potential) . 


FT 


TRANSMEM 


356 


376 


Potential . 


FT 


TOPO_DOM 


377 


420 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


421 


441 


Potential . 


FT 


TOPO_DOM 


442 


499 


Extracellular (Potential) . 


FT 


TRANSMEM 


500 


520 


Potential . 


FT 


TOPO_DOM 


521 


550 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


551 


571 


Potential . 


FT 


TOPO_DOM 


572 


588 


Extracellular (Potential) . 


FT 


TRANSMEM 


589 


609 


Potential . 


FT 


TOPO_DOM 


610 


714 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


715 


735 


Potential . 


FT 


TOPO_DOM 


736 


763 


Extracellular (Potential) . 


FT 


TRANSMEM 


764 


784 


Potential . 


FT 


TOPO_DOM 


785 


843 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


844 


864 


Potential . 


FT 


TOPO_DOM 


865 


933 


Extracellular (Potential) . 


FT 


VAR_SEQ 


110 


110 


Missing (in isoform 2 and isoform 3). 


FT 








/FTId=VSP_026004. 


FT 


VAR_SEQ 


158 


180 


QDVQDGNTTVHYALLSASWAVLC -> VRGGCHGQGPRPCI 


FT 








HSVTHDLAA (in isoform 2) . 


FT 








/FTId=VSP_026005. 


FT 


VAR_SEQ 


181 


933 


Missing (in isoform 2) . 


FT 








/FTId=VSP_026006 . 


FT 


VAR_SEQ 


859 


859 


H -> VAEAPAGSPIHGMRPRPCALPNSSTW (in 
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T isof orm 3 ) . 

T /FTId=VSP_026007 . 

T VAR_SEQ 860 933 Missing (in isoform 3) . 

T /FTId=VSP_026008 . 

T VARIANT 67 67 V -> I (in dbSNP : rs2302054 ) . 

T /FTId=VAR_032616 . 

Q SEQUENCE 933 AA; 105531 MW; D6FD42578A41D7D3 CRC64; 

Query Match 100.0%; Score 4950; DB 1; Length 933; 

Best Local Similarity 100.0%; Fred. No. 0; 

Matches 933; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

•y 1 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAERWAMTSETSSGSHCARSRMLRRRA 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
'b 1 MRMAATAWAGLQGPPLPTLCPAVRTGLYCRDQAHAERWAMTSETSSGSHCARSRMLRRRA 6 0 

61 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDL 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDL 120 

121 KLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 KLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLC 180 

181 YYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEVVPDVPPEYYSCRFRVNKLPRF 2 40 

I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 YYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEVVPDVPPEYYSCRFRVNKLPRF 2 40 

241 LGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKT 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

241 LGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKT 300 

301 PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPA 360 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3 01 PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPA 360 

361 AVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQAGRLFDH 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 AWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQAGRLFDH 420 

421 GGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPI 4 80 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 GGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPI 480 

481 TGEDEPYFPERSRARRMLAGSWIWMVAWVMCLVSIILYRAIMAIWSRSGNTLLAAW 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 TGEDEPYFPERSRARRMLAGSWIWMVAWVMCLVSIILYRAIMAIWSRSGNTLLAAW 540 

541 ASRIASLTGSVVNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFY 600 
I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

541 ASRIASLTGSVVNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFY 6 00 

6 01 SSPVYI AFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEV 6 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
601 SSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEV 660 
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661 LIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTI 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 LIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTI 72 0 

721 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 7 80 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 FVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISN 780 

781 AFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTCRYRAFRDDDGHYS 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
781 AFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTCRYRAFRDDDGHYS 840 

841 QTYWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEVL 9 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
8 41 QTYWNLLAIRLAFVI VFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEVL 900 

901 FGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 933 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
901 FGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 933 



RESULT 2 

TM16G_M0USE 

ID TM16G_M0USE Reviewed; 859 AA. 

AC Q14AT5; Q6IFT5; 

DT 29-MAY-2007, integrated into UniProtKB/Swiss-Prot . 

DT 29-MAY-2007, sequence version 2. 

DT 24-JUL-2007, entry version 9. 

DE Transmembrane protein 16G (New gene expressed in prostate homolog) . 

GN Name=Tmeml6g; Synonyms=Ngep; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RC STRAIN=C57BL/6J; 

RG The mouse genome sequencing consortium; 

RL Submitted (JAN-2002) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). 

RX PubMed=15489334; DOI=10 . 1101/gr . 2596504; 

RG The MGC Project Team; 

RT "The status, quality, and expansion of the NIH full-length cDNA 

RT project: the Mammalian Gene Collection (MGC)."; 

RL Genome Res. 14:2121-2127(2004). 

RN [3] 

RP IDENTIFICATION. 

RX PubMed=14981236; DOI=10 . 1073/pnas . 0308746101; 

RA Bera T.K., Das S., Maeda H., Beers R., Wolfgang CD., Kumar V., 

RA Hahn Y. , Lee B., Pastan I.; 

RT "NGEP, a gene encoding a membrane protein detected only in prostate 

RT cancer and normal prostate."; 

RL Proc. Natl. Acad. Sci. U.S.A. 101:3059-3064(2004). 
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CC -!- FUNCTION: May play a role in cell-cell interactions (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Cell membrane; Multi-pass membrane protein 
CC (By similarity) . Note=Concentrates at sites of cell-cell contact 

CC (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q14AT5-l; Sequence=Displayed; 

CC Name=2; 

CC IsoId=Q14AT5-2; Sequence=VSP_026009, VSP_026010; 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AC108412; -; NOT_ANNOTATED_CDS; Genomic_DNA. 

DR EMBL; ACl 24669; -; NOT_ANNOTATED_CDS; Genomic_DNA. 

DR EMBL; BC116706; AAI16707.1; -; mRNA. 

DR EMBL; BK004075; DAA04566.1; -; mRNA. 

DR UniGene; Mm. 297411; -. 

DR Ensembl; ENSMUSG00000034107; Mus musculus. 

DR MGI; MGI:3052714; Tmeml6g. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

KW Alternative splicing; Membrane; Transmembrane. 
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Matches 716; Conservative 43; Mismatches 95; Indels 9; Gaps 4; 

y 55 MLRRRAQEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAKPRIADFVL 114 

III : I : I I I I MM:: || | II I M I II I I II I I II II I I II II 

b 1 MLRGQAREEDSVVLIDMASPEAGNGCSYGSTAQASEAGKQQVAPSRVGSSAKPPI-DFVL 59 

115 VWEEDLKLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSA 174 
M M M II:: : I : I I I I I I I I I : I I I I I : I I I I I I I I I I I I 

60 VWEEDL RNQENPTKDKTDTHEVWRETFLENLCLAGLKIDQHDVQDEAAAVHYILLRA 116 

175 SWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEVVPDVPPEYYSCRFRV 2 34 

I I I I I I I I I I I I I I I II I II II II II II I II II II II : II I III I I I I I I I : I : 
117 PWAVLCYYAEDLRLKLPLQELPNQASNWSATLLEWLGIPNILLEHVPDTPPEYYSCQFKA 176 

235 NKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLH 2 94 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
177 SKLQWFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKGLFGIDQLLAEGVFSAAFPLH 236 

295 DGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVAL YFAWLGF YT 354 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23 7 DGPFSAVPESSQVLGLIQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYT 2 96 

355 GWLLPAAVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQA 414 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I 111:11111 II I I I I I I I I I I I I 
29 7 GWLLPAAVVGTWFLVGCFLVFSDIPTQELCHSSDSFDMCPLCSDCSFWLLSSACTLAQA 356 

415 GRLFDHGGTVFFSLFMALWAVLLLEYWKRKS ATLAYRWDCSDYEDTEERPRPQFAAS APM 4 74 

II II I M I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : II I 
35 7 GRLFDHGGTVFFSLFMALWAVLLLEYWKRKNATLAYRWDCSDYEDIEERPRPQFAATAPM 416 

475 TAPNPITGEDEPYFPERSRARRMLAGSWIVVMVAVWMCLVSIILYRAIMAIWSRSGN 534 
II I I I I I I I I I I I I I :: I I I I I I I I I I ::: I I I I I : I I I I I : I I I I I : I I I : I I I I I 
Db 417 TALNPITGEDEPYFPEKNRVRRMLAGSWLLMMVAVVIMCLVSVILYRAVMAIIVSRSDN 476 

Qy 535 TLLAAWASRIASLTGSVVNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIF 594 

I : I I I I I II I II II II II II II I I I I : I I II II II II II II II : I I I I I I I I I I I I 
Db 477 AFLSAWASRIASLTGSWNLVFILILSKVYVLLAQVLTRWEMHRTQTEFEDAFTLKVFIF 536 

Qy 595 QFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVI 654 

I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : I 
Db 53 7 QFVNFYASPVYIAFFKGRFVGYPGNYHTLFGIRNEECPAGGCLSELAQELLVIMVGKQII 596 

Qy 655 NNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQ 714 

I I : I I I I : I I I I I I I I I I : : I II I III II II : II I I I I I I I I I I I I 

Db 59 7 NNVQEVLVPKLKGCWQKF SRGKKAG — TGTHPAPWEADYELLPCEGLFHEYLEMVLQ 651 

Qy 715 FGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTH 774 

II M M II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II II 

Db 652 FGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILTGLTH 711 

Qy 775 LAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTCRYRAFRD 834 

I I I I I I I I I I I I I I I I I I I II II I II I I I I I I I I I I I : I : I I I I I I I I I I I I I 

Db 712 LAVISNAFLLAFSSDFLPRVYYSWTHAPDLHGFLNFTLARAPPTFTSAHNRTCRYRAFRD 7 71 

Qy 835 DDGHYSQTYWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQAL 894 

I I I I I I III M II I I I I I I I I I I I I I I : I I : I I I I I I I I I I I I I II II II II II I I I I 
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Db 7 72 DDGHYSPTYWTLLAIRLAFVIVFEHVVFSIGRVLDLLVPDIPESVEIKVKREYYLAKQAL 831 

Qy 895 AENEVLFGTNGTKDEQPKGSELS 917 

I M I I I I I I : I I III 
Db 832 AENEALLGATGVKDDQPPSSEPS 854 



RESULT 3 
TM16G_RAT 

ID TM16G_RAT Reviewed; 860 AA. 

AC Q6IFT6; 

DT 29-MAY-2007, integrated into UniProtKB/Swiss-Prot . 

DT 05-JUL-2004, sequence version 1. 

DT 24-JUL-2007, entry version 22. 

DE Transmembrane protein 16G (New gene expressed in prostate homolog) . 

GN Name=Tmeml6g; Synonyms=Ngep; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; 

OC Muroidea; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RC STRAIN=Brown Norway; 

RX PubMed=15057822; DOI=10 . 1038/nature02426 ; 

RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J., 

RA Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G., 

RA Hines S., Lewis L., Deramo C., Delgado 0., Dugan-Rocha S., Miner G., 

RA Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G., 

RA Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S., 

RA Fosler C, Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T., 

RA Pfannkoch CM., Sitter C, Sutton G.G., Venter J.C., Woodage T., 

RA Smith D., Lee H.-M., Gustafson E., Cahill P., Kana A., 

RA Doucette-Stamm L., Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., 

RA Green E.D., Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., 

RA Zhu B., Marra M., Schein J., Bosdet I., Fjell C, Jones S., 

RA Krzywinski M., Mathewson C, Siddiqui A., Wye N., McPherson J., 

RA Zhao S., Fraser C.M., Shetty J., Shatsman S., Geer K., Chen Y., 

RA Abramzon S., Nierman W.C., Havlak P.H., Chen R., Durbin K.J., Egan A., 

RA Ren Y., Song X.-Z., Li B., Liu Y., Qin X., Cawley S., Cooney A.J., 

RA D'Souza L.M., Martin K., Wu J.Q., Gonzalez-Garay M.L., Jackson A.R., 

RA Kalafus K.J., McLeod M.P., Milosavl jevic A., Virk D., Volkov A., 

RA Wheeler D.A., Zhang Z., Bailey J. A., Eichler E.E., Tuzun E., 

RA Birney E., Mongin E., Ureta-Vidal A., Woodwark C, Zdobnov E., 

RA Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J., 

RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., 

RA Schmidt J., Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., 

RA Abril J.F., Guigo R. , Smit A., Dubchak I., Rubin E.M., Couronne 0., 

RA Poliakov A., Huebner N., Ganten D., Goesele C, Hummel 0., 

RA Kreitler T., Lee Y.-A., Monti J., Schulz H., Zimdahl H., 

RA Himmelbauer H., Lehrach H., Jacob H.J., Bromberg S., 

RA Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E., Lazar J., 

RA Pasko D., Tonellato P.J., Twigger S., Ponting CP., Duarte J.M., 

RA Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., 

RA Webber C, Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., 
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RA Miller W. , Nekrutenko A., Riemer C, Schwartz S., Taylor J., Yang S., 

RA Zhang Y., Lindpaintner K., Andrews T.D., Caccamo M. , Clamp M. , 

RA Clarke L., Curwen V., Durbin R.M., Eyras E., Searle S.M., Cooper G.M., 

RA Batzoglou S., Brudno M., Sidow A., Stone E.A., Payseur B.A., 

RA Bourque G., Lopez-Otin C, Puente X.S., Chakrabarti K., Chatter ji S., 

RA Dewey C, Pachter L., Bray N., Yap V.B., Caspi A., Tesler G., 

RA Pevzner P. A., Haussler D., Roskin K.M., Baertsch R., Clawson H., 

RA Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J., Rosenbloom K.R., 

RA Trumbower H., Weirauch M., Cooper D.N., Stenson P.D., Ma B . , Brent M., 

RA Arumugam M., Shteynberg D., Copley R.R., Taylor M.S., Riethman H., 

RA Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S., Mockrin S., 

RA Collins F . S . ; 

RT "Genome sequence of the Brown Norway rat yields insights into 

RT mammalian evolution."; 

RL Nature 428:493-521(2004). 

RN [2] 

RP IDENTIFICATION. 

RX PubMed=14981236; DOI=10 . 1073/pnas . 0308746101; 

RA Bera T.K., Das S., Maeda H., Beers R., Wolfgang CD., Kumar V., 

RA Hahn Y., Lee B., Pastan I.; 

RT "NGEP, a gene encoding a membrane protein detected only in prostate 

RT cancer and normal prostate."; 

RL Proc. Natl. Acad. Sci. U.S.A. 101:3059-3064(2004). 

CC -!- FUNCTION: May play a role in cell-cell interactions (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Cell membrane; Multi-pass membrane protein 
CC (By similarity) . Note=Concentrates at sites of cell-cell contact 

CC (By similarity) . 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AABR03 06 8351 ; -; NOT_ANNOTATED_CDS ; Genomic_DNA. 

DR EMBL; AABR03 06 9 02 9; -; NOT_ANNOTATED_CDS ; Genomic_DNA. 

DR EMBL; AABR03 0 7 0 76 7 ; -; NOT_ANNOTATED_CDS ; Genomic_DNA. 

DR EMBL; BK004074; DAA04565.1; -; mRNA. 

DR UniGene; Rn. 163084; -. 

DR Ensembl; ENSRNOG00000023427; Rattus norvegicus. 

DR RGD; 1302987; Ngep. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 
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Query Match 




76. 0^ 


'5; Score 3764; DB 1; Length 


I 


Best Local Sir 


tiilarity 


82. 7^ 


'5; Pred. No. 4.3e-300; 



Matches 714; Conservative 47; Mismatches 94; Indels 8; Gaps 4; 

y 55 MLRRRAQEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAKPRIADFVL 114 

I I I : : I I I I I MM:: Ml M M M Ml I II I I II I I I II II 

b 1 MLRKQAGEEDSWLIDMTSPEAGNGCSYGSTAQASEAGKQQVAPSRVGSSANPPI-DFVL 59 

y 115 VWEEDLKLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSA 174 

II II II I : : : : I : I I I I I I I I I : I I I I I I : I I : I I I I I I I I I I I 

b 60 VWEEDL RSRENPTQDKTDTHEIWRETFLENLRVAGLKIDQRDVQDEAAAVHYILLSA 116 

•y 175 SWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEVVPDVPPEYYSCRFRV 234 

M M M M M M M M M I M I M M I M II II II M : M I III II II II I : I : 
'b 117 PWAVLCYYAEDLRLKLPLQELPNQASNWSATLLEWLGIPNILLENVPDTPPEYYSCQFKA 176 

y 235 NKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLH 294 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I II I I I I I I I I I I I I I I 
'h 111 SKLQWFLGSDNQDTFFTSTKRHQILFEILAKTPYGHQKKGLFGIDQLLAEGVFSAAFPLH 236 

'y 295 DGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYT 3 54 

II II II I I II II II : M M M M M II II II II II II II II II II II I I I 
'b 23 7 DGPFSVVPESSQVLGLTQRQVLFKHWARWGKWRKYQPLDHVRRYFGEKVALYFAWLGFYT 2 96 

!y 355 GWLLPAAVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLCLDCPFWLLSSACALAQA 414 

M M I M II II : II I I I I I I I I I : I I I I I I I I : I : I I I I I II I I I I I II I I II I 
b 29 7 GWLLPAAVVGTWFLAGCFLVFSDVPTQELCHSSDTFDMCPLCSDCSFWLLSSACTLAQA 356 

y 415 GRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPM 474 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : I I I 

'b 35 7 GRLFDHGGTVFFSLFMALWAVLLLEYWKRKNATLAYRWDCSDYEDIEERPRPQFAATAPM 416 

!y 475 TAPNPITGEDEPYFPERSRARRMLAGSWIVVMVAVWMCLVSIILYRAIMAIVVSRSGN 534 

II M II II II I M M : : I I I I I I I I I I ::: I I I I I : I I I I I I I I I I I : I I I : I I : I I 

'b 417 TALNPITGEDEPYFPEKNRVRRMLAGSWLLMMVAVVIMCLVSIILYRAVMAIIVSKSNN 476 

!y 535 TLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIF 594 

I : II I II I II II I II II I II I II II I : I I II I I I I I I I I I I I I I I I I I I I I I I I I 
'b 477 AFLSAWASRIASLTGSWNLVFILILSKVYVILAQVLTRWEMHRTQTAFEDAFTLKVFIF 536 

!y 595 QFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGCLIELAQELLVIMVGKQVI 654 

II II II : I M M M II II II II I I I I I I I I I I I I I I I I I I I I II I II II II II II I : I 
'b 53 7 QFVNFYASPVYIAFFKGRFVGYPGNYHTLFGVRNEECPAGGCLSELAQELLVIMVGKQII 596 
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Qy 655 NNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQ 714 

11:1111:11111 III I I : : : I I I I I : III I I I I : I I I I I I I I I I I I I I 
Db 59 7 NNVQEVLVPKLKGCWQK — LCSRRKKAG — MGANPAPWEADYELLPCEGLFHEYLEMVLQ 652 

Qy 715 FGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTH 774 

I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I II II II II II I I I I I I I I I I I I I I I I 
Db 653 FGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTH 712 

Qy 775 LAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAPSSFAAAHNRTCRYRAFRD 834 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I : I : I I I I I I I I I I I I I 
Db 713 LAVISNAFLLAFSSDFLPRVYYSWTRAPDLRGFLNFTLARAPPTFTSAHNRTCRYRAFRD 772 

Qy 835 DDGHYSQTYWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQAL 894 

I I I I I I III I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 73 DDGHYSPTYWTLLAIRLAFVIVFEHVVFSTGRFLDLLVPDIPESVEIKVKREYYLAKQAL 832 

Qy 895 AENEVLFGTNGTKDEQPKGSELS 917 

I : M I I I I I I I III 
Db 833 ADNEALLGATGVKGEQPPSSEPS 855 



RESULT 4 
A6QLE6_BOVIN 

ID A6QLE6_BOVIN Unreviewed; 920 AA. 

AC A6QLE6; 

DT 21-AUG-2007, integrated into UniProtKB/TrEMBL . 

DT 21-AUG-2007, sequence version 1. 

DT 21-AUG-2007, entry version 1. 

DE Putative uncharacterized protein. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; 

OC Pecora; Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=L1 Hereford; TISSUE=Basal ganglia; 

RA Moore S., Alexander L., Brownstein M., Guan L., Lobo S., Meng Y., 

RA Tanaguchi M., Wang Z., Yu J., Prange C, Schreiber K., Shenmen C, 

RA Wagner L., Bala M., Barbazuk S., Barber S., Babakaiff R., Beland J., 

RA Chun E., Del Rio L., Gibson S., Hanson R., Kirkpatrick R., Liu J., 

RA Matsuo C, Mayo M., Santos R.R., Stott J., Tsai M., Wong D., 

RA Siddiqui A., Holt R., Jones S.J., Marra M.A.; 

RL Submitted (JUN-2007) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BC147937; AAI47938.1; -; mRNA. 

PE 2: Evidence at transcript level; 

SQ SEQUENCE 920 AA; 107708 MW; 3AFFC6310F1862CD CRC64; 

Query Match 31.1%; Score 1539.5; DB 2; Length 920; 

Best Local Similarity 37.8%; Pred. No. 6.7e-117; 

Matches 361; Conservative 166; Mismatches 311; Indels 117; Gaps 29; 
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4 4 TSSGSHCARSRMLRRRAQEEDSTVLID VSPPEAE KRGSYGST AHASEP 91 

: I I I I : I : | : | : | | I : I : I : I I I I 

4 SSSGITNGRTRVFHPVA — KDVNILFDELEAVNSPCKDDDSLLHPGNLTSTSDDASRLEA 61 

92 GGQQAAACRAGSPAKPRI ADFVLVWEEDLKLDRQQDSAARDRTDMHRTW 140 

II: II:: I : : I I : I : I : 
62 GGETV PEKNKLNGLYFRDGKCRIDYILVYR KSNPQMEK 99 

141 RETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQE LP 196 

III I : I I I I : : : : I : : : I I I I I I I I : : : : I : I I 

100 REVFERNIRAEGLQMEKESSLI-NSDIIFVKLHAPWEVLGRYAEQMNVRMPFRRKIYYLP 158 

19 7 NQASNWS AGLLAWLGIPNVLL — EVVPDVPP-EYYSCRFRVNKLPRFLGSDNQD 2 47 

: I : II : I I : II : : I : I : : I : I : I 

159 RRYKFMSRIDKQISRFRRWLPKKPMRLDKETLPDLEENDCYTAPFSQQRIHHFI-IHNKD 217 

248 TFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQA 307 

III: I : I : I I : I I I I : I : : : I I I | | | | | | : | : : : 
218 TFFNNATRSRIVHHILQRIKY-EEGKNKIGLNRLLTNGSYEAAFPLHEGSYRSKNSIRTH 2 76 

3 0 8 PRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLV 3 6 7 

: I I : I : : II II I I I I I I I M I I I I I I : I I I I I I I : I I I I III : I I 
277 GAVNHRHLLYECWASWGVWYKYQPLDLVRRYFGEKIGLYFAWLGWYTGMLFPAAFIGLFV 336 

36 8 FLVGCFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFF 426 

Ml : : : I : I : I I I I : I III I I : I I : I I I : I II II 

33 7 FLYGVITLDHCQVSKEVCQATDII-MCPVCDKYCPFMRLSDSCVYAKVTHLFDNGATVFF 395 

42 7 SLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDE 485 

: : I I I : I I : I I : I I I : I : I I I I I : I : I I I I I I I : I I I : I : I 

396 AVFMAVWATVFLEFWKRRRAVIAYDWDLIDWEEEEEEIRPQFEAKYSKKERMNPISGKPE 455 

48 6 PYFPERSRARRMLAGSVVIVVMVAVVVMCLVSIILYRAIMAIVVSRSGNTLLA-AWA 541 

II : I : : : I I : M : : I : : I I : : I I II 

456 PYQAFADKCSRLIVSASGIFFMICWIAAVFGIVIYRWTV STFAAFKWALIR 508 

542 — SRIASLTGSW — NLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFV 59 7 

I : : I : I I : I I I : : I : : I : I : I I I I I : : : : I : : I I I I : I : I I I I 
509 NNSQVAT-TGTAVCINFCIIMLLNVLYEKVALLLTNLEQPRTESEWENSFTLKMFLFQFV 567 

59 8 NFYSSPVYIAFFKGRFVGYPGNYHTLFG-VRNEECAAGGCLIELAQELLVIMVGKQVINN 656 

I II 111111111:111 I I I I I I I I I : I : : : I I I I I II 

56 8 NLNSSTFYIAFFLGRFTGHPGAYLRLINRWRLEECHPSGCLIDLCMQMGIIMVLKQTWNN 62 7 

65 7 MQE VL IPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQ 714 

I : I : : I I : I : : : I I I I I I I I I I I I I I I I I : I I 

62 8 FMELGYPLIQNWWTR RKVRQEHGPERKISFPQWEKDYNLQPMNAYGLFDEYLEMILQ 684 

715 FGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTH 7 74 

III I I I I I I I I I I I I I I I I : I I I I I I III : : I I I : I I I : I I I I I : II I : 

685 FGFTTIFVAAFPLAPLLALLNNIIEIRLDAYKFVTQWRRPLASRAKDIGIWYGILEGIGI 7 44 

775 LAVISNAFLLAFSSDFLPRAYYRW TRAHDLRGFLNFTLA 813 

I : I I : I I I : : I : I I I : I I I : : | : : | : | : 

745 LSVITNAFVIAITSDFIPRLVYAYKYGPCAGQGEAGQKCMVGYVNASLSVFRISDFENRS 804 
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Qy 814 RAPSSFAAAHNRTCRYRAFRDDDGH YSQTYWNLLAIRLAFVIVFEHVVFSVG 865 

II: : I I I I : I I I : : I : : I I I I I I : I I I I I : I I : 

Db 805 EPESDGSEFSGTPLKYCRYRDYRDPPHSLVPYGYTLQFWHVLAARLAFIIVFEHLVFCIK 864 

Qy 866 RLLDLLVPDIPESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELSSHW 920 

I : I : II : I : : : : : || || : : : j j : | : : | : | 

Db 865 HLISYLIPDLPKDLRDRMRREKYLIQEMMYEAELERLQKERKERKKNGKAHHNEW 919 



RESULT 5 
Q1AP36_STRPU 

ID Q1AP36_STRPU Unreviewed; 903 AA. 

AC Q1AP36; 

DT ll-JUL-2006, integrated into UniProtKB/TrEMBL . 

DT ll-JUL-2006, sequence version 1. 

DT 24-JUL-2007, entry version 6. 

DE 122 kDa protein TMEM16 (Fragment) . 

OS Strongylocentrotus purpuratus (Purple sea urchin) . 

OC Eukaryota; Metazoa; Echinodermata; Eleutherozoa; Echinozoa; 

OC Echinoidea; Euechinoidea; Echinacea; Echinoida; Strongylocentrotidae; 

OC Strongylocentrotus. 

OX NCBI_TaxID=7668; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Coelomocyte; 

RA Galindo B.E., Moy G.W., Vacquier V.D.; 

RT "A 122 kDa protein from Strongylocentrotus purpuratus embryo belongs 

RT to TMEM16 protein family."; 

RL Submitted (APR-2005) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; DQ020165; AAY82886.1; -; mRNA. 

DR UniGene; Spu. 15325; -. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

FT NON_TER 1 1 

SQ SEQUENCE 903 AA; 104887 MW; 443A4A4A7E500 74A CRC64; 



Query Match 30.8%; Score 1525.5; DB 2; Length 903; 

Best Local Similarity 38.1%; Pred. No. 9.3e-116; 

Matches 338; Conservative 161; Mismatches 304; Indels 85; Gaps 21; 

Qy 61 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAK 106 

I : : : I : I I I : : I I : : I I I I I 

Db 17 QPDNAAV ANPQEIEM AESHMGAPTDKEDQPLHEGSPAKKDDNKVNDLFFKDG 68 

Qy 10 7 PRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVD QQDVQDG 163 

I MM: I : I : I j : : : j : j || || : : : : || 

Db 6 9 KRRIDFVLAY RKQESEEREEKRVKK — RQNFEANLIDEGLQLEYENSEESHDG 119 

Qy 16 4 NTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEVVPDV 2 23 
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I : : I I : : I I I : I : : I : I : : I : : I I II 

12 0 RT — FFVKVHAPWDLMTRYAEELKIKMPIEENNMEEPVNVFNCIDKLWTPFELSEEYVKP 177 

22 4 PPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLA 2 83 

I : : : I : : I : : : I I I I I : | : : : : : | | | : | | | | : | 

178 EPDVFTAPFIRDRASEFI-MESQDTFFPNNIRNRWYEILERMRYDANDPAKFGIDHLIA 236 

284 EGVLS AAFPLHDGPFKTP PEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYF 339 

I II : I I I : I : I : | | | I I : I :: I I I I : I I I I I I : I I II 

23 7 NGSYFAAYPLHEGDYKSKHSLLTHGPQ NDRHLLYEEWARPGRWYKKQPLDLIRRYF 2 92 

3 4 0 GEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLC-L 3 98 

I I I :: I I I I I I I I I I 11:11111 : I : I I : I I I I I I I 
293 GEKIGI YFCWLGF YTEMLTWAGFVGL I VFLYGCI SLPSS VWQE ICDGTDI I -MCPLCDK 351 

399 DCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYE 458 

I I : I I I : I : : III: I I I I : I I : I I I : I : I I I : I : I I I : I 

352 RCPYWTLSDSCFYSKLTYLFDNEATVFFACFMSLWATMFCEFWKRRQNTIDYDWDLFGFE 411 

459 DTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAVWMCLVSI 518 

: I I I I : I I I I : I I I III : I I I I : : I : : : : : 

412 EQEENIRPEFEAKAPDRRVSPITNLTEPYMKFSRKFPRFSASIASIFFMILLVMAAVMTV 4 71 

519 ILYRAIMAIVVSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEMHR 5 78 
!:!!:: : : : : : | | | | : | | : : : | : | : || : | : | || | : || 

4 72 IVYRIVVKTAIFAIDQEFISSYASIITSVTASMISLILIMILQILYERIAVWLTNLELHR 531 

579 TQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGV-RNEECAAGGCL 637 

I : I : : I I : I I I : : : I I I I : I I : I I I I I I I I I I : I : I I : I III I I : 
532 TETEYEDSFTFKMYLFAFVNYYSTSFYIAFFKGRLPGTPADYGRVFGIWRQEECDPAGCM 591 

63 8 lELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYE 697 

II : : I I II II I : : I 1 : I : I I : I : I III: 

592 QELFINIAITMCGKQFFNNFMELAMPVLMNFWRS-RTGRKEEKSGK GRYEQWEQDAD 6 47 

698 LVPC — EGLFDEYLEMVLQFGFVT IF VAACPLAPLFALLNNWVE IRLDARKF VCE YRRPV 755 

I III 111111:1111 I I I I I I I I I I I I I I I I I 11:1111 II: : II II 

648 LADLGPRGLFKEYLEMWQFGFSTIFVAAFPLAPLFALLNNLVEVRLDAYKFISQLRRPV 707 

756 AERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPR AYYRWTRAHDLRGFLNFTL 812 

I : I I I I I I I : I I : : I : I : : I I : : I I : I : I : I I II | | : | : : | 

708 AKRAQDIGAWYAILVTVGNLSVLTNALVIAFTSEFIPRQVFKYYYGGPEATLNGYTNWSL 767 

813 A RAPSSFAAAHNRT CRYRAFRDDDGHYSQT — YWNL 8 46 

: II I I I I I I : I I : I I I : 

768 SYFNTVDMQNDSKPTDPSYPRVGDEDTTDPNYGLNVSVCRYRGNYDE— HYNVTLDYWLV 825 

847 LAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQAL 894 

: I I : I I I : : : : I I I : : : I I : I I I : : : I I I I I : I I I 

826 lAIKLAFILLYEHFVLFTKFFVAYIIPDMPEFVKNQIKRETYLGQQAL 873 



RESULT 6 

TM16D_HUMAN 

ID TM16D_HUMAN 
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AC Q32M45; Q8NAJ0; Q8NB39; Q8NB53; 

DT 29-MAY-2007, integrated into UniProtKB/Swiss-Prot . 

DT 06-DEC-2005, sequence version 1. 

DT 21-AUG-2007, entry version 17. 

DE Transmembrane protein 16D. 

GN Name=TMEM16D; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 

OC Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 3). 

RC TISSUE=Brain, and Prostate; 

RX PubMed=14702039; DOI=10 . 1038/ngl285 ; 

RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Trie R. , 

RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., 

RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S., 

RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., 

RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., 

RA Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., 

RA Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., 

RA Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M. , Yamazaki M., 

RA Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., 

RA Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., 

RA Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., 

RA Kanehori K., Takahashi-Fu jii A., Hara H., Tanase T.-O., Nomura Y., 

RA Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., 

RA Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., 

RA Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., 

RA Moriya S., Momiyama H., Sat oh N., Takami S., Terashima Y., Suzuki 0., 

RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., 

RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., 

RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., 

RA Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., 

RA Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., 

RA Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., 

RA Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., 

RA Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., 

RA Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., 

RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., 

RA Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., 

RA Nakai K., Yada T., Nakamura Y., Ghara 0., Isogai T., Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1) . 

RX PubMed=15489334; DOI=10 . 1101/gr . 2596504; 

RG The MGC Project Team; 

RT "The status, quality, and expansion of the NIH full-length cDNA 

RT project: the Mammalian Gene Collection (MGC)."; 

RL Genome Res. 14:2121-2127(2004). 

RN [3] 

RP IDENTIFICATION. 

RX PubMed=12739008; 
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Katoh M. , Katoh M. ; 

"FLJ10261 gene, located within the CCNDl-EMSl locus on human 
chromosome llql3, encodes the eight-transmembrane protein homologous 
to C12orf3, Cllorf25 and FLJ34272 gene products."; 
Int. J. Oncol. 22:1375-1381(2003). 

-!- SUBCELLULAR LOCATION: Membrane; Multi-pass membrane protein 

(Potential) . 
-!- ALTERNATIVE PRODUCTS: 

Event=Alternative splicing; Named isoforms=3; 

Name=l ; 

IsoId=Q32M45-l ; Sequence=Displayed; 



Sequence=VSP_025742; 



Name=2 ; 

IsoId=Q32M45-2 
Name=3 ; 

IsoId=Q32M45-3; Sequence=VSP_025741, VSP_025743; 
Note=No experimental confirmation available; 
SIMILARITY: Belongs to the TMEM16 family. 



CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 
CC Distributed under the Creative Commons Attribution-NoDerivs License 



ALT_INIT; 
mRNA. 
mRNA. 
mRNA. 



mRNA. 



EMBL; AK091540; BAC03688. 
EMBL; AK091591; BAC03704. 
EMBL; AK092596; BAC03924. 
EMBL; BC109308; AAI09309. 
UniGene; Hs. 58785; -. 

Ensembl; ENSG00000151572; Homo sapiens. 

HGNC; HGNC: 23837; TMEM16D. 

MIM; 610111; gene. 

PharmGKB; PA134975112; -. 

ArrayExpress ; Q32M45; -. 

InterPro; IPR007632; DUF590. 

PANTHER; PTHR12308; DUF590; 1. 

Pfam; PF04547; DUF590; 1. 

2: Evidence at transcript level; 

Alternative splicing; Coiled coil; Glycoprotein; 
Polymorphism; Transmembrane. 



FT 


CHAIN 


1 


955 


Transmembrane protein 16D. 


FT 








/FTId=PRO_0000288650 . 


FT 


TRANSMEM 


353 


373 


Potential . 


FT 


TRANSMEM 


425 


445 


Potential . 


FT 


TRANSMEM 


506 


526 


Potential . 


FT 


TRANSMEM 


548 


568 


Potential . 


FT 


TRANSMEM 


596 


616 


Potential . 


FT 


TRANSMEM 


716 


736 


Potential . 


FT 


TRANSMEM 


769 


789 


Potential . 


FT 


TRANSMEM 


886 


906 


Potential . 


FT 


COILED 


918 


953 


Potential . 


FT 


CARBOHYD 


824 


824 


N-linked (GlcNAc. . .) (Potential). 


FT 


VAR_SEQ 


1 


433 


Missing (in isoform 3) . 


FT 








/FTId=VSP_025741 . 


FT 


VAR_SEQ 


19 


54 


EGGVDLQGYQLDMQILPDGPKSDVDFSEILNAIQEM 


FT 








(in isoform 2) . 


FT 








/FTId=VSP_025742 . 


FT 


VAR_SEQ 


466 


512 


Missing (in isoform 3) . 
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FT /FTId=VSP_025743 . 

FT VARIANT 115 115 G -> A (in dbSNP : r s3 41 62 41 7 ) . 

FT /FTId=VAR_032453 . 

FT CONFLICT 209 209 F -> L (in Ref . 1; BAC03704). 

SQ SEQUENCE 955 AA; 111462 MW; 9A9348C61A4F20AF CRC64; 

Query Match 30.8%; Score 1525.5; DB 1; Length 955; 

Best Local Similarity 38.2%; Pred. No. le-115; 

Matches 355; Conservative 163; Mismatches 309; Indels 103; Gaps 28; 

Qy 63 EDSTVLID VSPPEAE KRGSYGST AHASEPGGQQAAACRAGS PA 105 

: I : I I III: I : 1 I I III: : 

Db 56 KDVNILFDELEAVSSPCKDDDSLLHPGNLTSTSDDASRLEAGGETVPERNKSNGLYFRDG 115 

Qy 106 KPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNT 165 

I I I I : : I I : : : : I : III I : I I I I : : : : I : 

Db 116 KCRI-DYILVYRK SNPQTEK REVFERNIRAEGLQMEKESSLI-NS 158 

Qy 166 TVHYALLSASWAVLCYYAEDLRLKLPLQE LPNQASNWS AGLLAWLGIPNV 215 

: : I I I I I I I I : : : : I : I I : I : II : 

Db 159 DIIFVKLHAPWEVLGRYAEQMNVRMPFRRKIYYLPRRYKFMSRIDKQISRFRRWLPKKPM 218 

216 LL — EWPDVPP-EYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEK 2 72 

I I : I I : : I : I : : I : I : : I I I : I : I : I I : I I 
219 RLDKETLPDLEENDCYTAPFSQQRIHHFI-IHNKETFFNNATRSRIVHHILQRIKY-EEG 2 76 

2 73 KNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPL 332 

I I : I : : : I I I I I I I I I : I : : : I I : I :: I I I I I I I I I I 

2 7 7 KNKIGLNRLLTNGS YEAAFPLHEGS YRSKNS IRTHGAENHRHLL YECWASWGVWYKYQPL 336 

333 DHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFE 392 

I I I I I I I I I : I I I I I I I : I I I I I I I : I I I I I : : : I : I : I 

33 7 DLVRRYFGEKIGLYFAWLGWYTGMLFPAAFIGLFVFLYGVTTLDHSQVSKEVCQATDII- 395 

393 MCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYR 451 

I I I : I III I I : I I : I I I : I I I I I : : I I I : I I : I I : I I I : I : I I 
396 MCPVCDKYCPFMRLSDSCVYAKVTHLFDNGATVFFAVFMAVWATVFLEFWKRRRAVIAYD 455 

452 WDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAV 510 

II I : I : I I I I I I I : I I I : I : I I I : I : : : I I : I 
456 WDLIDWEEEEEEIRPQFEAKYSKKERMNPISGKPEPYQAFTDKCSRLIVSASGIFFMICV 515 

511 VVMCLVSIILYRAIMAIVVSRSGNTLLA-AWA SRIASLTGSVV — NLVFILILSK 562 

I : : I : : I I : : I I i I | : : | : | | : | | | : : | : 

516 VIAAVFGIVIYRVVTV STFAAFKWALIRNNSQVAT-TGTAVCINFCIIMLLNV 567 

56 3 lYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHT 6 22 

: I : I : I I I I I : : : : I : : I I I I : I : I I I I I II I I I I I I I I I : I I I 
568 LYEKVALLLTNLEQPRTESEWENSFTLKMFLFQFVNLNSSTFYIAFFLGRFTGHPGAYLR 62 7 

623 LFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKA 681 
I I I I I I I I I : I : : : I I I I I II I : I : : I I : I : : : 

62 8 LINRWRLEECHPSGCLIDLCMQMGIIMVLKQTWNNFMELGYPLIQNWWTR RKVRQEH 684 

682 GASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVE 739 
I I M I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I : I 
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6 8 5 GPERKI SFPQWEKDYNLQPMNAYGLFDEYLEMILQFGFTTIFVAAFPLAPLLALLNNI IE 7 44 

74 0 IRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRW- 7 98 

I I I M III : : I I I : I I I : I I I I I : II I : I : I I : I I I : : I : I I I : II I : 
745 IRLDAYKFVTQWRRPLASRAKDIGIWYGILEGIGILSVITNAFVIAITSDFIPRLVYAYK 804 

799 TRAHDLRGFLNFTLA RAPSSFAAAHNRTCRYRAFRDD 835 

: I : : I : I : I I : : I I I I : I I 

805 YGPCAGQGEAGQKCMVGYVNASLSVFRISDFENRSEPESDGSEFSGTPLKYCRYRDYRDP 864 

836 DGH YSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLA 890 

I : : I : : M M I I : I I I I I : I I : I : I : I I : I : : : : : II II 
86 5 PHSLVPYGYTLQFWHVLAARLAFIIVFEHLVFCIKHLISYLIPDLPKDLRDRMRREKYLI 924 

891 KQALAENEVLFGTNGTKDEQPKGSELSSHW 920 

: : : I I : I : : I : I 

925 QEMMYEAELERLQKERKERKKNGKAHHNEW 954 



RESULT 7 
Q1AP35_STRPU 

ID Q1AP35_STRPU Unreviewed; 921 AA. 

AC Q1AP35; 

DT ll-JUL-2006, integrated into UniProtKB/TrEMBL . 

DT ll-JUL-2006, sequence version 1. 

DT 24-JUL-2007, entry version 6. 

DE 122 kDa protein TMEM16. 

OS Strongylocentrotus purpuratus (Purple sea urchin) . 

OC Eukaryota; Metazoa; Echinodermata; Eleutherozoa; Echinozoa; 

OC Echinoidea; Euechinoidea; Echinacea; Echinoida; Strongylocentrotidae; 

OC Strongylocentrotus. 

OX NCBI_TaxID=7668; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Testis; 

RA Galindo B.E., Vacquier V.D.; 

RT "Phylogeny of the TMEM16 Protein Family: Some Are Over Expressed in 

RT Cancer."; 

RL Submitted (APR-2005) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; DQ020166; AAY82887.1; -; mRNA. 

DR UniGene; Spu. 15325; -. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

SO SEQUENCE 921 AA; 106711 MW; C3F912A32528F059 CRC64; 

Query Match 30.8%; Score 1523; DB 2; Length 921; 

Best Local Similarity 37.8%; Pred. No. 1.5e-115; 

Matches 339; Conservative 159; Mismatches 305; Indels 94; Gaps 21; 
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61 QEEDSTVLIDVSPPEAEKRGSYGSTAHASEPGGQQAAACRAGSPAK 106 

I : : : I : I I I : : I I : : I I I I I 

26 QPDNAAV ANPQEIEM AESHMGAPTDKEDQPLHEGSPAKKDDNKVNDLFFKDG 7 7 

10 7 PRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCV 155 

I MM: M I M M : : Ml M M : 

7 8 KRRIDFVLAY RKQESEEREEKRVKK — RQNFEANLIDEGLQLEYENSEGPEPK 128 

156 -DQQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPN 214 

I : Ml : : I I : : I I I : I : : |: I : : I : : I I 

12 9 EDDPESHDGRT — FFVKVHAPWDLMTRYAEELKIKMPIEENNMEEPVNVFNCIDKLWTPF 186 

215 VLLEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKN 2 74 

II M : : I : : I : : M II II : I : : : : M II : I 

18 7 ELSEEYVKPEPDVFTAPFIRDRASEFI-MESQDTFFPNNIRNRVVYEILERMRYDANDPA 2 45 

2 75 LLGIHQLLAEGVLSAAFPLHDGPFKTP PEGPQAPRLNQRQVLFQHWARWGKWNKYQ 330 

M I: I I I M II I: I : I: III I I M : : I I I M I I I 
246 KFGIDHLIANGSYFAAYPLHEGDYKSKHSLLTHGPQ NDRHLLYEEWARPGRWYKKQ 301 

331 PLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDS 390 

II I : I I I I I I I: : I I I I I I I I I I I I : I I I I I : I : I M I I 

302 PLDLIRRYFGEKIGIYFCWLGFYTEMLTWAGFVGLIVFLYGCISLPSSVWQEICAGTDI 361 

391 FEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLA 449 

II II I M : I I I : I : : MM I I I M I M I II : M II M M 
362 I-MCPLCDKRCPYWTLSDSCFYSKLTYLFDNEATVFFACFMSLWATMFCEFWKRRQNTID 42 0 

450 YRWDCSDYEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVA 509 

III : I: I I I I: I I I I MM III : I I I M 

421 YDWDLFGFEEQEENIRPEFEAKAPDRRVSPITNLTEPYMKFSRKFPRFSASIASIFFMIL 480 

510 VVVMCLVSIILYRAIMAIVVSRSGNTLLAAWASRIASLTGSVVNLVFILILSKIYVSLAH 569 

M : : : : M M I : : : : : : M I I M I I : : M : M II M M 

481 LVMAAVMTVIVYRIVVKTAIFAIDQEFISSYASIITSVTASMISLILIMILQILYERIAV 5 40 

57 0 VLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGV-RN 62 8 

II M II M I: M M II I: : M I I M I M II I II I II I I M MM I 
541 RLTNLELHRTETEYEDSFTFKMYLFAFVNYYSTSFYIAFFKGRLPGTPADYGRVFGIWRQ 600 

62 9 EECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGAS 688 
Ml I I: I I : : I I I I II M M I M : I M M I 

601 EECDPAGCMQELFINIAITMCGKQFFNNFMELAMPVLMNFWRS-RTGRKEEKSGK GR 6 56 

6 8 9 QGPWEDDYELVPC — EGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARK 7 46 

II I M III II I II I M II I I II II I I I II II I II II I M II II I 

657 YEQWEQDADLADLGPRGLFKEYLEMVVQFGFSTIFVAAFPLAPLFALLNNLVEVRLDAYK 716 

74 7 FVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPR AYYRWTRAHD 803 

I: : I I II M I II I II I: II : M M : M I : M M I M M I II 
717 FISQLRRPVAKRAQDIGAWYAILVTVGNLSVLTNALVIAFTSEFIPRQVFKYYYGGPEAT 7 76 

804 LRGFLNFTLA RAPSSFAAAHNRT CRYRAFRDDDGHY 839 

I I: I: M : II I I I I I MM 

77 7 LNGYTNWSLSYFNTVDMQNDSKPTDPSYPRVGDEDTTDPNYGLNVSVCRYRGNYDE — HY 834 
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Qy 840 SQT— YWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQAL 89 4 

: I I I : : I I : I I I : : : : I I I : : : | | : | | | : : : | | | | | : | | | 

Db 835 NVTLDYWLVIAIKLAFILLYEHFVLFTKFFVAYIIPDMPEFVKNQIKRETYLGQQAL 891 



RN 
RP 



Unreviewed; 



RESULT 8 
A2AHE5_MOUSE 

ID A2AHE5_MOUSE 

AC A2AHE5; 

DT 20-FEB-2007, integrated into UniProtKB/TrEMBL . 

DT 20-FEB-2007, sequence version 1. 

DT 21-AUG-2007, entry version 6. 

DE Novel protein (B230324K02Rik) . 

GN Name=RP23-300M9.6; ORFNames=RP23-300M9 . 6-002; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RA Smith M. ; 

RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RA Kay M. ; 

RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. 



NUCLEOTIDE SEQUENCE. 
Phillimore B . ; 

Submitted (JUL-2007) 



to the EMBL/GenBank/DDBJ databases. 



Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 
Distributed under the Creative Commons Attribution-NoDerivs License 



EMBL; AL7 
EMBL; AL7 
EMBL 
EMBL 
EMBL; AL7 
EMBL; BXO 
EMBL 
EMBL 
EMBL; AL7 
InterPro; 
PANTHER; 
Pfam; PFO 
4: Predic 
SEQUENCE 



31700 
31779 
05257 
31779 

31700 
05257 
05257 
31700 
31779 
IPRO 
PTHRl 
4547; 
ted; 



CAMl 
; CAMl 
CAMl 
CAM2 
; CAM2 
; CAM2 
CAM2 
CAM2 
; CAM2 
07632; 
2308; 
DUF59 



8325. 1; 
8325.1; 
8325.1; 
5697.1; 
5697.1; 
5697.1; 
7817.1; 
7817. 1; 

817.1; 

DUF590 
DUF590; 
0; 1. 



-; Genomic_DNA. 
JOINED; Genomic_DNA. 
JOINED; Genomic_DNA. 
-; Genomic_DNA. 
JOINED; Genomic_DNA. 
JOINED; Genomic_DNA. 
-; Genomic_DNA. 
JOINED; Genomic_DNA. 
JOINED; Genomic_DNA. 



81 AA; 114568 MW; E851D80BC49F4977 CRC64; 



Query Match 30.6%; Score 1512.5; DB 2; Length 981; 

Best Local Similarity 38.9%; Pred. No. 1.2e-114; 

Matches 333; Conservative 166; Mismatches 279; Indels 79; Gaps 



106 KPRI ADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNT 165 
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I I I I : : I I : : I : : III I I I I I I : : : : I 

161 KKRI-DYILVYR KTNIQYDKRNTFEKNLRAEGLMLEKEPA-I ANP 2 03 

166 TVHYALLSASWAVLCYYAEDLRLKLPLQ ELPNQASNWSAGLLAWLGIPNV 215 

: : : I I I I I I I : : : I : : : : | : : | : : 

204 DIMFIKIHIPWDTLCKYAERLNIRVPFRKKCYYTDQKNKSKSRVQNYFKRIKKWMSQNPM 263 

216 LLE — WPDV-PPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPY — GH 2 70 
: I : I : : : I : I : : I : : I : I I I I : : I : I : : : I : I I | 

264 VLDKSAFPELEESDCYTGPFSRARIHHFI-INNKDTFFSNATRSRIVYHMLERTKYENGI 322 

2 71 EKKNLLGIHQLLAEGVLSAAFPLHDGPFKT PPEGPQAPRLNQRQVLFQHWARWGKW 326 

I : I I : I : I I I I I I : I : I : III I I : I :: I I I I I I 

323 SK VGIRKLITNGSYIAAFPPHEGAYKSSLPIKTHGPQ NNRHLLYERWARWGMW 3 75 

32 7 NKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQELCG 3 86 
I : I I I I : I I I I I I : I I M I I I : I I I I : I I I I I I II I : : I I : I 

3 76 YKHQP LDL I RMYF GEK I GL YF AWL GWYTGML I P AAVVGL C VFF YGL VTMNE S QVS QE I CK 435 

38 7 SKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKS 445 

:: I I I I I I : I I : : I I : I I I : I I I I I I : : I I I : I I : I I : I I I : 

436 ATEVF-MCPLCDKNCSLQRLNDSCIYAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKRRR 494 

446 ATLAYRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSVVI 504 

: I I I I : : I : I I I I I I I I II I I I : I I : I : I : I I 

495 SILTYTWDLIEWEEEEETLRPQFEAKYYRMEVINPITGKPEPHQPSSDKVTRLLVSVSGI 554 

505 WMVAVVVMCLVSIILYR-AIMAIWSRSGNTLLAAWASRIASLTGSV-VNLVFILILSK 562 

I : : : I : : : : : : I I : I I I : I : I : : I : I : I : : I : 

555 FFMISLVITAVFAVWYRLWMEQFASFKWNFVKQHW — QFATSGAAVCINFIIIMLLNL 612 

563 lYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHT 622 
I : I : : II I I I : : : : I : : I I I : I : I I I I I II I I I I I I I I I I : I I I : 

613 AYEKIAYLLTNLEYPRTESEWENSFALKMFLFQFVNLNSSIFYIAFFLGRFVGHPGKYNK 6 72 

62 3 LF-GVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKA 6 81 

M I I I I I I I I : I : : I I I I I : I I I : I : : I I : : : I 
673 LFERWRLEECHPSGCLIDLCLQMGVIMFLKQIWNNFMELGYPLIQNWWSRHKI KR 727 

682 GAS AGASQGPWEDDYELVP — CEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVE 739 

I II I I : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I 

72 8 GIQ-DASIPQWENDWNLQPMNIHGLMDEYLEMVLQFGFTTIFVAAFPLAPLLALLNNIIE 786 

74 0 IRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRW- 7 98 
I I I I I III : : I I I : II I I I I I II I : I I I I : I I I : : I : I I : : I I I : 

78 7 IRLDAYKFVTQWRRPLPARATDIGIWLGILEGIGILAVITNAFVIAITSDYIPRFVYEYK 8 46 

79 9 TRAHDLRGFLNFTLARAP-SSFAAAHNRTCRYRAFR DDDGHYSQTY 8 43 

: I : I : : I : I : I : I I I I : I : : I 

84 7 YGPCANHVKQNENCLKGYVNNSLSFFDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTLQY 906 

844 WNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEVLFGT 903 
I : : I I I I I I : I I I I I : I I : : I : I I I I : : : : : | | | | : : : | | : 

90 7 WHILAARLAFIIVFEHLVFGIKSFIAYLIPDIPKGLRERIRREKYLVQEMMYEAEL E 963 

904 NGTKDEQPKGSELSSHW 920 
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: : : I : I 

Db 964 HLQQQRRKSGQPIHHEW 980 



TM16C_HUMAN 

ID TM16C_HUMAN Reviewed; 981 AA. 

AC Q9BYT9; 

DT 16-JAN-2004, integrated into UniProtKB/Swiss-Prot . 

DT Ol-JUN-2001, sequence version 1. 

DT 24-JUL-2007, entry version 31. 

DE Transmembrane protein 16C. 

GN Name=TMEM16C; Synonyms=Cllorf 25; ORFNames=GENX-3 947 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 

OC Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] . 

RA Rosier M.F., Toselli E., Segurens-Soury B., Auffray C, Devignes M.D.; 

RT "Predominant brain expression and full-length characterization of a 

RT novel human 6.6-Kb transcript mapping at llpl4 in the telomeric part 

RT of WAGR locus . " ; 

RL Submitted (NOV-2000) to the EMBL/GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION: Membrane; Multi-pass membrane protein 

CC (Probable) . 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AJ300461; CAC32454.1; -; mRNA. 

DR UniGene; Hs. 91791; -. 

DR Ensembl; ENSG00000134343; Homo sapiens. 

DR KEGG; hsa:63982; -. 

DR HGNC; HGNC: 14004; TMEM16C. 

DR MIM; 610110; gene. 

DR PharmGKB; PA25489; -. 

DR ArrayExpress; Q9BYT9; -. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

KW Membrane; Transmembrane. 



FT 


CHAIN 


1 


981 


Transmembrane protein 16C 


FT 








/FTId=PRO_0 000 072565. 


FT 


TRANSMEM 


398 


420 


Potential . 


FT 


TRANSMEM 


471 


490 


Potential . 


FT 


TRANSMEM 


553 


575 


Potential . 


FT 


TRANSMEM 


590 


612 


Potential . 


FT 


TRANSMEM 


642 


664 


Potential . 


FT 


TRANSMEM 


759 


781 


Potential . 


FT 


TRANSMEM 


809 


831 


Potential . 


FT 


TRANSMEM 


904 


926 


Potential . 
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Q SEQUENCE 981 AA; 114655 MW; 15A3276420912393 CRC64; 

Query Match 30.4%; Score 1504; DB 1; Length 981; 

Best Local Similarity 39.4%; Pred. No. 6.1e-114; 

Matches 329; Conservative 163; Mismatches 268; Indels 76; Gaps 24; 

:y 106 KPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNT 165 

I I I I : : I I : : I : : III I I I I I I : : : : : 

'b 161 KRRI-DYILVYR KTNIPYDKRNTFEKNLRAEGLMLEKEPA-IASP 2 03 

!y 166 TVHYALLSASWAVLCYYAEDLRLKLPLQ ELPNQASNWSAGLLAWLGIPNV 215 

: : : | | | | | | | : : : | : : : : : | : : 

'b 204 DIMFIKIHIPWDTLCKYAERLNIRMPFRKKCYYTDGRSKSMGRMQTYFRRIKDWMAQNPM 263 

•y 216 LLE— VVPDV-PPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPY — GH 2 70 

: I : II: : I : I : : I : : I : I I I I : : I : I : : : I : I I I 
b 264 VLDKSAFPDLEESDCYTGPFSRARIHHFI-INNKDTFFSNATRSRIVYHMLERTKYENGI 322 

y 271 EKKNLLGIHQLLAEGVLSAAFPLHDGPFKT PPEGPQAPRLNQRQVLFQHWARWGKW 326 

I : I I : I : I I I I I I : I : I : III I I : I :: I I I I I I 

b 323 SK VGIRKLINNGSYIAAFPPHEGAYKSSQPIKTHGPQ NNRHLLYERWARWGMW 3 75 

32 7 NKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQELCG 386 

I : I I I I : I I I I I I : I I I I I I I : I I I I : I I I : I I II I I : : : I I : I 
376 YKHQP LDL I RL YF GEK I GL YF AWL GWYTGML I P AAI VGL C VFF YGLF TMNNS QVS QE I CK 435 

38 7 SKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKS 4 45 

:: I I I I I I : I I : : I I : I I I : I I I I I I : : I I I : I I : I I : I II : 

436 ATEVF-MCPLCDKNCSLQRLNDSCIYAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKRRR 494 

446 ATLAYRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSWI 504 

: I I I I : : I : I I I I I I I I I I I I I : I I : I : I : I I 

495 SILTYTWDLIEWEEEEETLRPQFEAKYYKMEIVNPITGKPEPHQPSSDKVTRLLVSVSGI 554 

505 VVMVAVVVMCL VS I I L YR-AIMAI WSRSGNTLL AAWASRI ASLTGS V- VNL VF ILILSK 562 

!:::!: : : : : II : I I I : I : I : : | : | : | : : | : 

555 FFMISLVITAVFGVWYRLWMEQFASFKWNFIKQYW — QFATSAAAVCINFIIIMLLNL 612 

563 lYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHT 622 

I : I : : I I I I I : : : : I : : I I I : I : I I I I I II I I I I I I I I I I : I I I : 
613 AYEKIAYLLTNLEYPRTESEWENSFALKMFLFQFVNLNSSIFYIAFFLGRFVGHPGKYNK 6 72 

62 3 LFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKA 6 81 

II I I I I I I I I : I : : I I I II : I I I : I : : I I : : : I 
6 73 LFDRWRLEECHPSGCLIDLCLQMGVIMFLKQIWNNFMELGYPLIQNWWSRHKI KR 72 7 

682 GASAGASQGPWEDDYELVP— CEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVE 739 

I II I I : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I 

72 8 GIH-DASIPQWENDWNLQPMNLHGLMDEYLEMVLQFGFTTIFVAAFPLAPLLALLNNIIE 786 

740 IRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRW- 798 
I I I I I III : : I I I : II I I I I I II I : I I I I : I I I : : I : I I : : I I I : 

78 7 IRLDAYKFVTQWRRPLPARATDIGIWLGILEGIGILAVITNAFVIAITSDYIPRFVYEYK 846 

799 TRAHDLRGFLNFTLARAP-SSFAAAHNRTCRYRAFR DDDGHYSQTY 843 

: I : I : : I : I : I : I I I I : I : : I 
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Db 847 YGPCANHVEPSENCLKGYVNNSLSFFDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTLQY 906 

Qy 844 WNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEV 899 

I : : M I I I I : I I I I I : I I : : I : I I : I : : : : : | | | | : : : | | : 
Db 907 WHILAARLAFIIVFEHLVFGIKSFIAYLIPDVPKGLHDRIRREKYLVQEMMYEAEL 962 



RESULT 10 

A1A5Z4_DANRE 

ID A1A5Z4_DANRE Unreviewed; 900 AA. 

AC A1A5Z4; 

DT 23-JAN-2007, integrated into UniProtKB/TrEMBL . 

DT 23-JAN-2007, sequence version 1. 

DT 24-JUL-2007, entry version 6. 

DE Zgc:158430. 

GN ORFNames=zgc: 158430; 

OS Danio rerio (Zebrafish) (Brachydanio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cyprinif ormes; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7955; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Kidney; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Kidney; 

RG NIH MGC Project; 

RL Submitted (DEC-2006) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BC128870; AAI28871.1; -; mRNA. 
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'R UniGene; Dr. 108272; -. 

'R ZFIN; ZDB-GENE-061215-108; zgc:158430. 

'R InterPro; IPR007632; DUF590. 

'R PANTHER; PTHR12308; DUF590; 1. 

'R Pfam; PF04547; DUF590; 1. 

E 2: Evidence at transcript level; 

Q SEQUENCE 900 AA; 105263 MW; DB8656 7CCE3D153E CRC64; 

Query Match 30.3%; Score 1500.5; DB 2; Length 900; 

Best Local Similarity 38.0%; Pred. No. l.le-113; 

Matches 323; Conservative 170; Mismatches 266; Indels 91; Gaps 20; 

y 10 8 RIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDNLRAAGLCVDQQDVQDG-NTT 166 

I I I I I : : I : I : : : I I II | | | : : : | : : 

'h 49 RRIDFVLSYVDDKEGDKKAER RREFEANLEKAGLELETEDKSESDDRK 96 

16 7 VHYALLSASWAVLCYYAEDLRLKLPL QELPNQASNWSAGLLAWLGIPNVLLE 218 

M : I I I I I I : I : : I : I : I : I III I II 

9 7 THYLKIHAPWEVLATYADVLKIKVPFKVSDIPKAREVP LEWLSHPFRLPE 146 

219 WPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHE KKNL 2 75 

: I : I : : I : I : II I : : I I I I : I : : I : : I I : I I I : I : 
14 7 NIMRPEPDYFTAPFDKSKVDFFL-IDDKDTFFPPSTRNRIVYYILTRCPYYKEDRKEKDK 2 05 

2 76 LGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHV 335 

I I : : I I I : : I : I I I I : : I : : I I : : : I I I : : I I I I : : 
2 06 TGINRLLNNGTYTSAYPLHDCRYWKKAQDMQCE — SERYHLYRYWARFLCFYKEQPLNLI 2 63 

336 RRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSK — DSFEM 393 

: : I : I I I : : I I I I I I I I I I I I I : I : I : I : I : : I : I I I 

26 4 KKYYGEKIGIYFAWLGFYTEMLFYAAVMGVICFVYGVLSYEDNITSKEICDPKIGGMIVM 323 

39 4 CPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRW 452 

I I I I I : I I : I I : I I I : I I I I I : : I I : I I I I : I I I : I I I I 

32 4 CPLCDKKCSYWKLNSTCLSSWQSHLFDNEGTVFFAMFMGIWVTLFLEFWKRRQARLEYEW 3 83 

453 DCSDYEDTEE — RPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAV 510 

I I : I : : : : I I : : I I I I I I I I I : I : : : 

384 DLVDFEEEQQQLQIRPEYEQKCTGRRLNRITQEMEPYLPFPSKCARFCLSGATVLFWTCL 443 

511 WMCLVSIILYR AIMAI WSRSGNTLLAAWASRI ASLTGSWNLVFILILS 561 

: I I : : : I I I I I I : I : : II I : I I : I I I I I I : 

44 4 IVACIMGVIAYRLAVYAAFASVMKDSSTSKIQLVGSLITPQLATSVTASCINFVIILILN 503 

56 2 KIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYH 621 

: I : I : I I : : I : : I : I : I : I : I I I I I : I I I I : I I I I I : I I I I I I I I 
504 FLYEHVAIWITDMEIPKTHLEYENKLTMKMFMFQFVNYYSSCFYVAFFKGKFVGYPGNYS 563 

622 TLFG VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSK 6 77 

: I I : I I I I I I I I I I I I I : I I : : I III:: I : I I I : I : : II 
564 YMFGKWSTLRNEECAPGGCLIELTTQLLIVMAGKQMVGNVQEALLPLVRNWW S 616 

678 KRKAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLN 735 

II : : : I I I : : I I I I I I I I I I : I I I I : I : I I I : I I I I I I I I 

617 SRKGRSHPESTYSRWEQDHDLQNFSQFGLFYEYLEMVIQFGFITLFVASFPLAPLLALFN 6 76 
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Qy 



73 6 NWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPR— 7 93 




Db 



677 NILEVRVDAWKFTTQFRRPMAAKARNIGAWEEILNVVAIMSVVTNAFIMAFTSDMIPRLV 736 



Qy 



794 AYYRWTRAHDLRGFLNFTLA RAPSSFAAAHNRTCRYRAFRD 834 



Db 



I I : I : : I : : : i : Mi:: I I I I I : I 

73 7 YLYAYHPGIEA-NMTGYITNSLSIYNISQIPEDNLPEAGENPSWFNSSTITTCRYRDYRY 795 



Qy 



835 DDGH YSQT — YWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYL 889 



Db 



796 PPGHLRQYTHTMQFWHILAAKLAFIIIMEHVVFVVKFFVAWLIPDVPSEVKARIKRERFL 855 



Qy 



89 0 AKQALAENEV 89 9 



Db 



: : I M 

856 VQEYLHNYEV 865 



RESULT 11 

Q1AP37_STRPU 

ID Q1AP3 7_STRPU 



Unreviewed; 



921 AA. 



AC Q1AP37; 

DT ll-JUL-2006, integrated into UniProtKB/TrEMBL . 

DT ll-JUL-2006, sequence version 1. 

DT 24-JUL-2007, entry version 6. 

DE 122 kDa protein TMEM16. 

OS Strongylocentrotus purpuratus (Purple sea urchin) . 

OC Eukaryota; Metazoa; Echinodermata; Eleutherozoa; Echinozoa; 

OC Echinoidea; Euechinoidea; Echinacea; Echinoida; Strongylocentrotidae; 

OC Strongylocentrotus. 

OX NCBI_TaxID=7668; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RA Galindo B.E., Moy G.W., Vacquier V.D.; 

RT "A 122 kDa protein from Strongylocentrotus purpuratus embryo belongs 

RT to TMEM16 protein family."; 

RL Submitted (APR-2005) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; DQ020164; AAY82885.1; -; mRNA. 

DR UniGene; Spu. 15325; -. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

SQ SEQUENCE 921 AA; 106699 MW; 9A516A5DCA1ADCCC CRC64; 

Query Match 30.2%; Score 1497; DB 2; Length 921; 

Best Local Similarity 38.6%; Pred. No. 2.1e-113; 

Matches 330; Conservative 149; Mismatches 291; Indels 86; Gaps 19; 

Qy 102 GSPAK PRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRETFLDN 147 

I I I I I I I I I I : I : I : I I : : : | : | | 

Db 59 GSPAKKDDNKVNDLFFKDGKRRIDFVLAY RKQESEEREEKRVKK — RQNFEAN 109 
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148 LRAAGLCV DQQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQEL 195 

I II: I : III : : I I : : I I I : I : : I : I : : I 

110 LIDEGLQLEYENSEGPEPKEDDPESHDGRT — FFVKVHAPWDLMTRYAEELKIKMPIEEN 16 7 

196 PNQASNWSAGLLAWLGIPNVLLEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKR 2 55 

: : I I I I I : : : I : : | : : : | || || : | 

168 NMEEPVNVFNCIDKLWTPFELSEEYVKPEPDVFTAPFIRDRASEFI-MESQDTFFPNNIR 226 

256 HQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTP PEGPQAPRLN 311 

: : : : : I I I : I II I : I I I I : I I I : I : I : III I 

22 7 NRVVYEILERMRYDANDPTKFGIDHLIANGSYFAAYPLHEGDYKSKHSLLTHGPQ N 2 82 

312 QRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVG 3 71 

I : I :: I I I I : I I I I I I : I I I I I I :: I I I I I I I I I I I I : I I I I 
283 DRHLLYEEWARSGRWYKKQPLDLIRCYFGEKIGIYFCWLGFYTEMLTWAGFVGLIVFLYG 3 42 

3 72 CFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFM 430 

I : I : I I : I I I I I I I I I : I I I : I : : III: I I I I : I I 

343 CISLPSSVWQEICAGTDII-MCPLCDKRCPYWTLSDSCFYSKLTYLFDNEATVFFACFM 401 

431 ALWAVLLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPITGEDEPYFPE 4 90 

: I I I : I : I I I : I : I M : I : I I I I : I I I I : I I I II 

402 SLWATMFCEFWKRRQNT I DYDWDLFGFEEQEENIRPEFE AKAPDRRVSP I TNLPEQYMKF 461 

491 RSRARRMLAGSVVIWMVAWVMCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGS 550 

I I I I : : I : : : : : I : I I : : : : : : : | | | | : | | 

462 SKRFPRFSTSVPTIAFMILLVMAAVMTVIVYRIVVKTAIFAIDQEFISSYASIITSVTAS 521 

551 WNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFK 610 
: : : I : I : I I : I : I II I : I I I : I : : I I : I I I : : : I I I I : I I : I I I I I I 

52 2 MISLILIMILQILYERIAVWLTNLELHRTETEYEDSFTFKMYLFAFVNYYSTSFYIAFFK 581 

611 GRFVGYPGNYHTLFGV-RNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWW 669 

II I I : I : I I : I I I I II II : : I I I I II I : : I I : I 

582 GRLPGTPADYGRVFGIWRQEECDPAGCTQELFINIAITMCGKQFFNNFMELAMPVLMNFW 641 

670 QKFRLRSKKRKAGASAGASQGPWEDDYELVPC — EGLFDEYLEMVLQFGFVTIFVAACPL 727 

: I I : I : I I I I : I I I I I I I I I I : I I I I I I I I I I I I 

642 RS-RTGRKEEKSGK GRYEQWEQDADLADLGPRGLFKEYLEMWQFGFSTIFVAAFPL 697 

72 8 APLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFS 787 

I I I I I I I I I I I : I I I I II: : I I I I I : I I I I I I I : II : : I : I : : I I : : I I : 
698 APLFALLNNLVEVRLDAYKFISQLRRPVAKRAQDIGAWYAILVTVGNLSVLTNALVIAFT 757 

788 SDFLPR AYYRWTRAHDLRGFLNFTLA RAPSSFAAAHNRT 826 

I : I : I I II I I : I : : I : | | | 

75 8 SEFIPRQVFKYYYGGPEATLNGYTNWSLSYFNTVDMQNDSKPTDPSYPRVGDEDTTDPNY 817 

82 7 CRYRAFRDDDGHYSQT — YWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIPES 8 78 

I I I I I : I I : I I I : : I I : I I I : : : : I I I : : : I I : I I 

818 GLNVSVCRYRGNYDE — HYNVTLDYWLVIAIKLAFILLYEHFVLFTKFFVAYIIPDMPEF 8 75 

8 79 VEIKVKREYYLAKQAL 8 94 

I : : : I I I II : I I I 
876 VKNQIKRETYLGQQAL 891 



http://es/ScoreAccessWeb/Gefltem.action?AppId=10552...4_083148_us-10-552-515-l.rup&ItemType=4&startByte=0 (28 of 41)10/10/2008 8:48:54 AM 



SCORE Search Results Details for Application 10552515 and Search Result 20080624_083148_us-10-552-515-l.rup. 



RESULT 12 
TM16A_M0USE 

ID TM16A_iy[0USE Reviewed; 960 AA. 

AC Q8BHY3; Q6P5C6; Q8BI26; Q99JK1; 

DT 2 9-MAY-2007, integrated into UniProtKB/Swiss-Prot . 

DT 2 9-MAY-2007, sequence version 2. 

DT 24-JUL-2007, entry version 32. 

DE Transmembrane protein 16A. 

GN Name=Tmeml6a; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). 

RC STRAIN=C57BL/6J; TISSUE=Head, and Kidney; 

RX PubMed=16141072; DOI=10 . 1126/science. 1112014; 

RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N 

RA Oyama R., Ravasi T., Lenhard B., Wells C, Kodzius R. , Shimokawa K., 

RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., 

RA Davis M.J., Wilming L.G., Aidinis V., Allen J.E., 

RA Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., 

RA Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., 

RA Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., 

RA Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Delia Gatta G., 

RA di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., 

RA Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., 

RA Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., 

RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., 

RA Hill D., Huminiecki L., lacono M. , Ikeo K., Iwama A., Ishikawa T., 

RA Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., 

RA Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., 

RA Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., 

RA Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., 

RA Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., 

RA Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P . , 

RA Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara 0., 

RA Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., 

RA Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., 

RA Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C, 

RA Schoenbach C, Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y 

RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., 

RA Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K. 

RA Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A 

RA Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., 

RA Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C, 

RA Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., 

RA Wahlestedt C, Mattick J.S., Hume D.A., Kai C, Sasaki D., Tomaru Y. 

RA Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., 

RA lida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., 

RA Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., 

RA Nishio T., Okada M., Plessy C, Shibata K., Shiraki T., Suzuki S., 
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RA Tagami M. , Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., 

RA Hayashizaki Y. ; 

RT "The transcriptional landscape of the mammalian genome."; 

RL Science 309:1559-1563(2005). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2) . 

RC STRAIN=C57BL/6, and Czech II; TISSUE=Eye, and Mammary tumor; 

RX PubMed=15489334; DOI=10 . 1101/gr . 2596504; 

RG The MGC Project Team; 

RT "The status, quality, and expansion of the NIH full-length cDNA 

RT project: the Mammalian Gene Collection (MGC)."; 

RL Genome Res. 14:2121-212 7(20 04). 

CC -!- SUBCELLULAR LOCATION: Cell membrane; Multi-pass membrane protein 
CC (By similarity) . Cytoplasm (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q8BHY3-l; Sequence=Displayed; 

CC Name=2; 

CC IsoId=Q8BHY3-2; Sequence=VSP_025672; 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC -!- SEQUENCE CAUTION: 

CC Sequence=BAC35051 . 1 ; Type=Miscellaneous discrepancy; Note=Wrong choice of frame; 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 



CC 

DR EMBL; AK028991; BAC26230.1; ALT_INIT; mRNA. 

DR EMBL; AK029329; BAC26398.1; ALT_INIT; mRNA. 

DR EMBL; AK052589; BAC35051.1; ALT_SEQ; mRNA. 

DR EMBL; BC006062; AAH06062.1; ALT_INIT; mRNA. 

DR EMBL; BC062959; AAH62959.1; -; mRNA. 

DR UniGene; Mm. 26 700; -. 

DR Ensembl; ENSMUSG00000031075; Mus musculus. 

DR MGI; MGI:2142149; TmemlGa. 



DR ArrayExpress; Q8BHY3 ; -. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

KW Alternative splicing; Cytoplasm; Glycoprotein; Membrane; 

KW Transmembrane. 



FT 


CHAIN 


1 


960 


Transmembrane protein 16A. 


FT 








/FTId=PRO_0000288436 . 


FT 


TOPO_DOM 


1 


333 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


334 


354 


Potential . 


FT 


TOPO_DOM 


355 


402 


Extracellular (Potential) . 


FT 


TRANSMEM 


403 


423 


Potential . 


FT 


TOPO_DOM 


424 


493 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


494 


514 


Potential . 


FT 


TOPO_DOM 


515 


539 


Extracellular (Potential) . 


FT 


TRANSMEM 


540 


560 


Potential . 


FT 


TOPO_DOM 


561 


581 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


582 


602 


Potential . 


FT 


TOPO_DOM 


603 


705 


Extracellular (Potential) . 


FT 


TRANSMEM 


706 


726 


Potential . 
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FT 


TOPO_DOM 


727 


765 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


766 


786 


Potential . 


FT 


TOPO_DOM 


787 


856 


Extracellular (Potential) . 


FT 


TRANSMEM 


857 


877 


Potential . 


FT 


TOPO_DOM 


878 


960 


Cytoplasmic (Potential) . 


FT 


CARBOHYD 


806 


806 


N-linked (GlcNAc. . .) (Potential) 


FT 


VAR_SEQ 


448 


451 


Missing (in isoform 2) . 


FT 








/FTId=VSP_0256 72 . 


SQ 


SEQUENCE 


960 AA; 


110916 


MW; BFD0112FD310CE88 CRC64; 



Query Match 30.1%; Score 1489; DB 1; Length 960; 

Best Local Similarity 37.5%; Pred. No. le-112; 

Matches 361; Conservative 170; Mismatches 303; Indels 128; Gaps 29; 

Qy 2 6 GLYCRDQAHAERWAMT — SETSSGSHCARSRMLRRRAQEEDSTVLIDVSPPEAEKRGSYG 83 

I I I I I : : : : I I I I I I I : I : I : 

Db 52 GLYFRDGKRKVDYILVYHHKRASG SRTLARRGLQNDMVL GTRS 94 

Qy 84 STAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRET 143 

II:: I I I I I : I : I III 

Db 95 VRQDQPLPG— KGSPVDAGSPEVP MDYHEDD KRFRREE 130 

Qy 144 FLDNLRAAGLCVDQQDVQDGNTTVH YALLSASWAVLCYYAEDLRLKLPLQELPNQAS 2 00 

: II III:: I : I : I : : I I I I I I I I : I I : I : : : : : 

Db 131 YEGNLLEAGLELE NDEDTKIHGVGFVKIHAPWHVLCREAEFLKLKMPTKKVYHISE 186 

Qy 201 NWSAGLLAWLGIPNVLLEWPDVPPEYYSCRFRVNKLPRFLGS DNQDTFFT 251 

: I I I I : I I : : : I : I : I : I : : I : I I 

Db 187 — TRGLLK — TINSVLQKITDPIQPKVAEHRPQTTKRLSYPFSREKQHLFDLTDRDSFFD 242 

Qy 252 STKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLN 311 

I I I : : I I I : I I : : I I I I I I I I I I : I I I I I : II I 
Db 243 SKTRSTIVYEILKRTTCTKAKYS-MGITSLLANGVYSAAYPLHDGDY EGDNV-EFN 296 

Qy 312 QRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVG 3 71 

!::!:: II : I : I I I I : I I I : I I I I I I I I I I I I I II I : I I : : II : I I I I 
Db 29 7 DRKLLYEEWASYGVFYKYQPIDLVRKYFGEKVGLYFAWLGAYTQMLIPASIVGVIVFLYG 356 

Qy 372 CFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFM 430 

I I : I I : I : I : : I I I I I I : I : I I I I I I : I III: I I I I I : I I 
Db 357 CATVDENIPSMEMCDQRYNITMCPLCDKTCSYWKMSSACATARASHLFDNPATVFFSVFM 416 

Qy 431 ALWAVLLLEYWKRKSATLAYRWDCSDYEDTEE RPRPQFAA SAPMTAPNPIT 481 

I I I I : I : I I I I I I I I I : : I : M I I : : I I : I I 

Db 417 ALWAATFMEHWKRKQMRLNYRWDLTGFEEEEEAVKDHPRAEYEARVLEKSLRKESRNKET 4 76 

Qy 482 GEDEPYFPERSRARRMLAGSVVIVVMVAVWMCLVSIILYRAIMAIVVSRSGNTLLAAWA 541 

I : II I I : I : I I : : : I : I I I : : : : : : 

Db 477 — DKVKLTWRDRFPAYFTNLVSIIFMIAVTFAIVLGVIIYRISTAAALAMNSSPSVRSNI 534 

Qy 542 SRIASLTGSVVNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYS 601 

: I : : I II I : : I : : I : I | | : | : : | : | | : | | | : : | | I I : 
Db 535 RVTVTATAVIINLVVIILLDEVYGCIARWLTKIEVPKTEKSFEERLTFKAFLLKFVNSYT 594 

Qy 602 SPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNMQE 659 
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1:111111111 I I : I : I I I I I I I I I I : I I : I : I I : I I I : I I I : I 

59 5 PIFYVAFFKGRFVGRPGDYVYIFRSFRMEECAPGGCLMELCIQLSIIMLGKQLIQNNLFE 6 54 

66 0 VLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVT 719 

: I I I : I : : : I I : : : I I : I I II I I : I I :: I I I I I I 

655 IGIPKMKKFIRYLKLRRQSPSDREEYVKRKQRYEVDFNLEPFAGLTPEYMEMIIQFGFVT 714 

72 0 IFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVIS 7 79 

: I I I : I I I I I I I I I I I : I I I I I I : I I I I I I I I I I I : I I I I I :: I I I : I I I I 
715 LFVASFPLAPLFALLNNIIEIRLDAKKFVTELRRPVAIRAKDIGIWYNILRGVGKLAVII 7 74 

780 NAFLLAFSSDFLPRAYYRWTRAHD — LRGFLNFTLARAPSSF AAAHN 824 

I I I : : : I : I I I : I I I : : : : I I : I I I III II 

775 NAFVISFTSDFIPRLVYLYMYSQNGTMHGFVNHTL SSFNVSDFQNGTAPNDPLDLG 830 

825 RTCRYRAFRD DDGHY — SQTYWNLLAIRLAFVI VFEHVVFSVGRLLDLLVPDIP 8 76 

: I I I : : I : : I | : : | : | I I I II II I I : : : I : : | : : || || 

831 YEVQICRYKDYREPPWSEHKYDISKDFWAVLAARLAFVIVFQNLVMFMSDFVDWVIPDIP 8 90 

877 ESVEIKVKREYYL AKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVP 926 

: : : : : I I | | | : : : I : I : : : | : | | | 

891 KDISQQIHKEKVLMVELFMREEQGKQQLLDTWM EKEKPRDVPCNNH-SPTTHP 942 

927 KA 928 
: I 

943 EA 944 



RESULT 13 
TM16A_HUMAN 

ID TM16A_HUMAN Reviewed; 986 AA. 

AC Q5XXA6; Q8IYY8; Q8N7V3 ; 

DT 29-MAY-2007, integrated into UniProtKB/Swiss-Prot . 

DT 23-NOV-2004, sequence version 1. 

DT 21-AUG-2007, entry version 18. 

DE Transmembrane protein 16A (Discovered on gastrointestinal stromal 

DE tumors protein 1) (Oral cancer overexpressed protein 2) (Tumor 

DE amplified and overexpressed sequence 2). 

GN Name=TMEM16A; Synonyms=DOGl , 0RA0V2, TA0S2; 

OS Homo sapiens (Human) . 

00 Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

00 Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 

OC Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), AND TISSUE SPECIFICITY. 

RX PubMed=16906560; DOI=10 . 1002/gcc . 20371 ; 

RA Huang X., Godfrey T.E., Gooding W.E., McCarty K.S. Jr., Gollin S.M.; 

RT "Comprehensive genome and transcriptome analysis of the llql3 amplicon 

RT in human oral cancer and synteny to the 7F5 amplicon in murine oral 

RT carcinoma."; 

RL Genes Chromosomes Cancer 45:1058-1069(2006). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3). 

RC TISSUE=Testis; 

RX PubMed=14702039; DOI=10 . 1038/ngl285; 
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RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R. , 

RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., 

RA Sekine M. , Obayashi M. , Nishi T., Shibahara T., Tanaka T., Ishii S., 

RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., 

RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., 

RA Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kendo H., Sugawara M., 

RA Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., 

RA Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., 

RA Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., 

RA Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., 

RA Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., 

RA Kanehori K., Takahashi-Fu j ii A., Kara H., Tanase T . -0 . , Nomura Y., 

RA Togiya S., Komai F., Kara R., Takeuchi K., Arita M. , Imose N., 

RA Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., 

RA Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., 

RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki 0., 

RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., 

RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., 

RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., 

RA Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., 

RA Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., 

RA Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., 

RA Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., 

RA Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., 

RA Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., 

RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., 

RA Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., 

RA Nakai K., Yada T., Nakamura Y., Ohara 0., Isogai T., Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 

RN [3] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). 

RC TISSUE=Testis; 

RX PubMed=15489334; DOI=10 . 1101/gr . 2596504; 

RG The MGC Project Team; 

RT "The status, quality, and expansion of the NIH full-length cDNA 

RT project: the Mammalian Gene Collection (MGC)."; 

RL Genome Res. 14:2121-2127(2004). 

RN [4] 

RP IDENTIFICATION, AND TOPOLOGY. 

RX PubMed=12739008; 

RA Katoh M. , Katoh M. ; 

RT "FLJ10261 gene, located within the CCNDl-EMSl locus on human 

RT chromosome llql3, encodes the eight-transmembrane protein homologous 

RT to C12orf3, Cllorf25 and FLJ34272 gene products."; 

RL Int. J. Oncol. 22:1375-1381(2003). 

RN [5] 

RP SUBCELLULAR LOCATION, AND TISSUE SPECIFICITY. 

RX PubMed=15215166; 

RA West R.B., Corless C.L., Chen X., Rubin B.P., Subramanian S., 

RA Montgomery K., Zhu S., Ball C.A., Nielsen T.O., Patel R. , 

RA Goldblum J.R., Brown P.O., Heinrich M.C., van de Rijn M. ; 

RT "The novel marker, DOGl, is expressed ubiquitously in gastrointestinal 

RT stromal tumors irrespective of KIT or PDGFRA mutation status."; 
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RL Am. J. Pathol. 165:107-113(2004). 

RN [6] 

RP SUBCELLULAR LOCATION. 

RX PubMed=16261155; D0I = 1 0 . 1 03 8/s j . one . 1209203 ; 

RA Carles A., Millon R. , Cromer A., Ganguli G., Lemaire F., Young J., 

RA Wasylyk C, Muller D., Schultz I., Rabouel Y., Dembele D., Zhao C, 

RA Marchal P., Ducray C, Bracco L., Abecassis J., Pooh 0., Wasylyk B.; 

RT "Head and neck squamous cell carcinoma transcriptome analysis by 

RT comprehensive validated differential display."; 

RL Oncogene 25:1821-1831(2006) . 

CO -!- SUBCELLULAR LOCATION: Cell membrane; Multi-pass membrane protein. 

CC Cytoplasm. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=l; 

CC IsoId=Q5XXA6-l ; Sequence=Di splayed; 

CC Name=2; 

CC IsoId=Q5XXA6-2; Sequence=VSP_025665, VSP_025668, VSP_025669; 

CC Note=No experimental confirmation available; 

CC Name=3; 

CC IsoId=Q5XXA6-3; Sequence=VSP_025666, VSP_025667, VSP_025668, 

CC VSP_025669, VSP_025670, VSP_025671; 

CC Note=No experimental confirmation available; 

CC -!- TISSUE SPECIFICITY: Broadly expressed with higher levels in liver 
CC and skeletal muscle. 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AY728143; AAU82085.1; -; mRNA. 

DR EMBL; AK097619; BAC05123.1; -; mRNA. 

DR EMBL; BC033036; AAH33036.2; -; mRNA. 

DR UniGene; Hs. 503074; -. 

DR Ensembl; ENSG00000131620; Homo sapiens. 

DR HGNC; HGNC:21625; TMEM16A. 

DR MIM; 610108; gene. 

DR PharmGKB; PA128394678; -. 

DR ArrayExpress; Q5XXA6 ; -. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 2. 

DR Pfam; PF04547; DUF590; 1. 

PE 1: Evidence at protein level; 

KW Alternative splicing; Cytoplasm; Glycoprotein; Membrane; Polymorphism; 

KW Transmembrane. 



FT 


CHAIN 


1 


986 


Transmembrane protein 16A 


FT 








/FTId=PRO_0000288435. 


FT 


TOPO_DOM 


1 


333 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


334 


354 


Potential . 


FT 


TOPO_DOM 


355 


402 


Extracellular (Potential) 


FT 


TRANSMEM 


403 


423 


Potential . 


FT 


TOPO_DOM 


424 


519 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


520 


540 


Potential . 


FT 


TOPO_DOM 


541 


565 


Extracellular (Potential) 


FT 


TRANSMEM 


566 


586 


Potential . 


FT 


TOPO_DOM 


587 


606 


Cytoplasmic (Potential) . 
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FT 


TRANSMEM 


607 


627 


Potential . 


FT 


TOPO_DOM 


628 


731 


Extracellular (Potential) . 


FT 


TRANSMEM 


732 


752 


Potential . 


FT 


TOPO_DOM 


753 


791 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


792 


812 


Potential . 


FT 


TOPO_DOM 


813 


882 


Extracellular (Potential) . 


FT 


TRANSMEM 


883 


903 


Potential . 


FT 


TOPO_DOM 


904 


986 


Cytoplasmic (Potential) . 


FT 


CARBOHYD 


832 


832 


N-linked (GlcNAc. . .) (Potential). 


FT 


VAR_SEQ 


1 


116 


Missing (in isoform 2) . 


FT 








/FTId=VSP_025665 . 


FT 


VAR_SEQ 


1 


28 


Missing (in isoform 3) . 


FT 








/FTId=VSP_025666 . 


FT 


VAR_SEQ 


29 


36 


GYLPSEGT -> MLTRPSQV (in isoform 3) . 


FT 








/FTId=VSP_025667. 


FT 


VAR_SEQ 


448 


451 


Missing (in isoform 2 and isoform 3). 


FT 








/FTId=VSP_025668 . 


FT 


VAR_SEQ 


476 


501 


Missing (in isoform 2 and isoform 3). 


FT 








/FTId=VSP_025669. 


FT 


VAR_SEQ 


651 


700 


CAPGGCLMELCIQLSIIMLGKQLIQNNLFEIGIPKMKKLIR 


FT 








YLKLKQQSP -> VTEILFISGSPFCLAYDLSTPCTWEKQL 


FT 








QHICSAKSSRFLSFLLETFLFP (in isoform 3) . 


FT 








/FTId=VSP_025670 . 


FT 


VAR_SEQ 


701 


986 


Missing (in isoform 3) . 


FT 








/FTId=VSP_025671. 


FT 


VARIANT 


608 


608 


F -> S (in dbSNP:rs2186797) . 


FT 








/FTId=VAR_032417. 


FT 


VARIANT 


983 


983 


G -> R (in dbSNP:rs3740722) . 


FT 








/FTId=VAR_032418 . 


FT 


CONFLICT 


801 


801 


N -> D (in Ref. 3; AAH33036). 


FT 


CONFLICT 


948 


948 


W -> C (in Ref. 3; AAH33036). 


SQ 


SEQUENCE 


986 AA; 


114078 


MW; E30A02F91EF36FC2 CRC64; 



Query Match 29.9 
Best Local Similarity 36.8 
Matches 365; Conservative 



■>; Score 1482; DB 1; 
>; Pred. No. 4e-112; 
162; Mismatches 302; 



Length 986; 



Indels 164; 



Qy 

Db 
Qy 



26 GLYCRDQAHAERWAMT — SETSSGSHCARSRMLRRRAQEEDSTVLIDVSPPEAEKRGSYG 83 

I I I I I : : : I I : I I I I I I : I 
52 GLYFRDGRRKVDYILVYHHKRPSG NRTLVRRVQHSDTP SGA 92 

84 STAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRTWRET 143 

: I : I : I I I I : I : I III 
93 RSVKQDHPLPGKGASLDAGSGEPP MDYHEDD KRFRREE 130 



14 4 FLDNLRAAGLCVDQQDVQDGNTTVH YALLSASWAVLCYYAEDLRLKLPLQELPNQAS 2 0 0 

: M III:: : I : I : I : : I I I I I I I I : I I : I : : : : 
131 YEGNLLEAGLELE RDEDTKIHGVGFVKIHAPWNVLCREAEFLKLKMPTKKMYH— I 184 



Qy 

Db 

Qy 



201 NWSAGLLAWLGIPNVLLEWPDVPPEYYSCR FRVNKLPRFLGSDNQDTFF 250 

I : I I I I : I I : : : I : I I I I I I : I : I I 

185 NETRGLLK — KINSVLQKITDPIQPKVAEHRPQTMKRLSYPFSREKQHLFDLSD-KDSFF 2 41 

251 TSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRL 310 
I I I : : I I I : I I : : I I I I I I I : I I : I I I I I : : 
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Db 242 DSKTRSTIVYEILKRTTCTKAKYS-MGITSLLANGVYAAAYPLHDGDY NGENVEF 295 



311 NQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLV 3 70 

I I : : I : : I I I : I : I I I I : I I I : I I I ! I : I I I I I I I II I : I I : : I I : I I I 

296 NDRKLLYEEWARYGVFYKYQPIDLVRKYFGEKIGLYFAWLGVYTQMLIPASIVGIIVFLY 3 55 

3 71 GCFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLF 42 9 

II : : I I : I : I : : I I I I I I : I : I I I I I I : I III: I I I I I : I 

356 GCATMDENIPSMEMCDQRHNITMCPLCDKTCSYWKMSSACATARASHLFDNPATVFFSVF 415 

430 MALWAVLLLEYWKRKSATLAYRWDCSDYEDTEE RPRPQFAA SAPMTAPNPI 480 

I I I I I : I : I I I I I I I I I : : I : I I I I : : I I : I 

416 MALWAATFMEHWKRKQMRLNYRWDLTGFEEEEEAVKDHPRAEYEARVLEKSLKKESRNK- 4 74 

481 TGEDEPYFPERS RARRMLAG SVVIVVMVAVVVM 513 

I : I I I I : : I I I I : I : I I 

475 — EKRRHIPEESTNKWKQRVKTAMAGVKLTDKVKLTWRDRFPAYLTNLVSIIFMIAVTFA 532 

514 CLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTR 573 
: : : I : I I II : : : : : : : I : : I I I I : : I : : I : I I I : 

533 I VLGVI I YRI SMAAALAMNSSPSVRSNIRVTVTATAVI INLVVI ILLDEVYGCI ARWLTK 592 

57 4 WEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECA 632 

I : : I : I I : I I : : I I I I : I : I I I I I I I I I I I : I : I I I I I I 

593 lEVPKTEKSFEERLIFKAFLLKFVNSYTPIFYVAFFKGRFVGRPGDYVYIFRSFRMEECA 652 

633 AGGCLIELAQELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGP 6 91 

1111:11 : I : I I : I I I : I I I : I : I I I : I : : I : : : 
653 PGGCLMELCIQLSIIMLGKQLIQNNLFEIGIPKMKKLIRYLKLKQQSPPDHEECVKRKQR 712 

692 WEDDYELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEY 751 

: I II I I II I I : I I : : I I I I I I : I I I : I I I I I I I I I I I : I I I I I I : I I I I 
713 YEVDYNLEPFAGLTPEYMEMIIQFGFVTLFVASFPLAPLFALLNNIIEIRLDAKKFVTEL 7 72 

752 RRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRA — YYRWTRAHDLRGFLN 8 09 

I I I I I I I : I I I I I :: I I I : I I I I M i : : : I : I I I : I I I : : : : I I : I 

7 73 RRPVAVRAKDIGIWYNILRGIGKLAVIINAFVISFTSDFIPRLVYLYMYSKNGTMHGFVN 832 

810 FTLARAPSSF AAAHN RTCRYRAFRD DDGHY — SQTYWNLLA 848 

II III II : I I I : : I : : | | : : | : | | 

833 HTL SSFNVSDFQNGTAPNDPLDLGYEVQICRYKDYREPPWSENKYDISKDFWAVLA 888 

849 IRLAFVIVFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLA KQALAENE 8 98 

I I I I I I I I ::: I : : | : : | | | | : : : : : | | | | | | 

889 ARLAFVIVFQNLVMFMSDFVDWVIPDIPKDISQQIHKEKVLMVELFMREEQDKQQLL — E 9 46 

899 VLFGTNGTKDEQP KGSELSSH 919 

I I I I II II 

94 7 TWMEKERQKDEPPCNHHNTKACPDSLGSPAPSH 979 



RESULT 14 

Q8CFW1_M0USE 

ID Q8CFW1_M0USE Unreviewed; 913 AA. 

AC Q8CFW1; 

DT Ol-MAR-2003, integrated into UniProtKB/TrEMBL . 
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DT Ol-MAR-2003, sequence version 1. 

DT 24-JUL-2007, entry version 21. 

DE Transmembrane protein 16B. 

GN Name=Tmeml6b; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Eye; 

RX MEDLINE=22388257; PubMed=12 4 77932 ; DOI=10 . 1073/pnas . 242603899 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Eye; 

RG NIH MGC Project; 

RL Submitted (JUN-2002) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BC033409; AAH33409.1; -; mRNA . 

DR UniGene; Mm. 333398; -. 

DR Ensembl; ENSMUSG00000038115 ; Mus musculus. 

DR MGI; MGI:2387214; Tmeml6b. 

DR GO; GO: 0016021; C : integral to membrane; IEA:UniProtKB-KW. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

KW Transmembrane. 

SQ SEQUENCE 913 AA; 104388 MW; CA17DB27D8167F64 CRC64; 

Query Match 29.6%; Score 1467.5; DB 2; Length 913; 
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Best Local Similarity 38.1%; Pred. No. 5.6e-lll; 

Matches 331; Conservative 167; Mismatches 274; Indels 97; Gaps 22; 

y 7 8 KRGS YGSTAH — ASEPGGQQAAACRAGSPAKPRIADFVLVWEEDLKLDRQQDSAARD 132 

III: : I I I I I : II : : I I 

b 2 0 KRGAHLGHGSPGHSLAVISNGETGKERHGGGPGDVELGPLDALEEE 65 

y 133 RTDMHRTWRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPL 192 

I I : I I I I I I I : : : I : : : : : I I I I I I I : : I : I 

b 6 6 RREQRDEFEHNLMAAGLEL-EKDLESKSQGSVFVRIHAPWQVLAREAEFLKIKVPT 120 

y 193 QEL PNQASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCRFRVNKLP RF 2 40 

: : : : | : | | : | | | : | | | | : : | : | 

b 121 KKMYEIKAGGSIAKKFSA-ILQTLSSP LQPRV-PEHSNNRMKNLSYPFSREKMYL 173 

y 241 LGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKT 300 

: I I I I : I : I : I I I : I | : || : | : | : | | : | | | | | : : 

b 174 YNIQEKDTFFDNATRSRIVHEILKRTACS-RANNTMGINSLIANNIYEAAYPLHDGEYDS 232 

301 PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGF YTGWLLPA 360 

I : : I I : : I : I I I I : I : I : I I : I : I : I I I I I : I I I I I I I II : I : I : 
233 PGDD MNDRKLLYQEWARYGVFYKFQPIDLIRKYFGEKIGLYFAWLGLYTSFLIPS 287 

361 AVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFD 419 

: I : I : I I I I I : I I I : : I : I : : : 1 I I I I I I : I I I I I I I : I III 

28 8 SVIGVIVFLYGCATIEEDIPSKEMCDHQNAFTMCPLCDKSCDYWNLSSACGTARASHLFD 3 47 

420 HGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEER PRPQF 468 

: I I I I I : I I I I I I : I I I I I Mil: I : I I I I I : : 

348 NPATVFFS IFMALWATMFLENWKRLQMRLGYFWDLTGIEEEEERSQEHSRPE YETKVREK 407 

469 AASAPMTAPNPITGE-DEPYFPERSRARRMLAGSVVIVVMVAVVVMCLVSIIL 520 

: I : I : I III : I I I : I : I : : : I : 

40 8 LLKESGKSAVQKLEANSPEDDEDDEDKLTWKDRFPGYLMNFASILFMIALTFSIVFGVIV 46 7 

521 YRAIMAIVVSRSGNTLLAAWASRI ASLTGSVVNLVFILILSKIYVSLAHVLTRWEMH 5 77 

II I : I I I I : : I : : I I I I I I I : I I : : I II : I : 
468 YRITTAAALS LNKATRSNVRVTVTATAVIINLWILILDEIYGAVAKWLTKIEVP 522 

578 RTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGC 636 

: I : II: I I I :: I I I I I I : I I I I I I I I I I I : I : I I I I I I I I I I 

523 KTEQTFEERLILKAFLLKFVNAYSPIFYVAFFKGRFVGRPGSYVYVFDGYRMEECAPGGC 582 

63 7 LIELAQELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDD 6 95 

I : I I : I : I I : I I I : I I I : I : : I I I I : : I : : : : : I : I 

58 3 LMELCIQLSIIMLGKQLIQNNIFEIGVPKLKKLFRKLKDETEPGESDPDHSKRPEQWDLD 6 42 

696 YELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPV 755 

: I I II I I : I I : : I I I I I I : I I I : I I I I : I I I I I I : I : I I I I : I I I I III 

643 HSLEPYTGLTPEYMEMIIQFGFVTLFVASFPLAPVFALLNNVIEVRLDAKKFVTELRRPD 702 

756 AERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHD — LRGFLNFTLA 813 
I I : I I I I I I I I : I : : I I I I I : : I : I I I : I I I : : : : I : I I I : I II : 

70 3 AVRTKDIGIWFDILSGIGKFSVIINAFVIAVTSDFIPRLVYQYSYSHNGTLHGFVNHTLS 762 

814 RAPSSFAAAHNRTCRYRAFRD DDGHYSQTYWNLLAIRLAFVIVF 857 
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: : I : I I : : : I : : : | : | | : : | : | | | | | | : | 

Db 763 FFNVSQLKEGTQPENSQFDQEVQFCRFKDYREPPWAPNPYEFSKQYWSVLSARLAFVIIF 822 

Qy 858 EHVVFSVGRLLDLLVPDIPESVEIKVKRE 886 

: : : I : I : I : : I I I I : : : I : I 
Db 823 QNLVMFLSVLVDWMIPDIPTDISDQIKKE 851 



RESULT 15 
TM16B_HUMAN 

ID TM16B_HUMAN Reviewed; 999 AA. 

AC Q9NQ90; 

DT 16-JAN-2004, integrated into UniProtKB/Swiss-Prot . 

DT Ol-OCT-2000, sequence version 1. 

DT 24-JUL-2007, entry version 35. 

DE Transmembrane protein 16B. 

GN Name=TMEM16B; Synonyms=C12orf 3 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 

OC Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] . 

RC TISSUE=Retina; 

RA Lorenz B., White K.E., Econs M.J., Strom T.M.; 

RT "Transcripts in 12pl3.3."; 

RL Submitted (FEB-2000) to the EMBL/GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION: Membrane; Multi-pass membrane protein 

CC (Probable) . 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AJ272204; CAC01125.1; -; mRNA. 

DR UniGene; Hs. 148970; -. 

DR Ensembl; ENSG00000047617; Homo sapiens. 

DR KEGG; hsa:57101; -. 

DR HGNC; HGNC:1183; TMEM16B. 

DR MIM; 610109; gene. 

DR PharmGKB; PA25504; -. 

DR LinkHub; Q9NQ90; -. 

DR ArrayExpress; Q9NQ90; -. 

DR GermOnline; ENSG00000047617; Homo sapiens. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

KW Membrane; Polymorphism; Transmembrane. 

FT CHAIN 1 999 Transmembrane protein 16B. 

FT /FTId=PRO_0000072564. 

FT TRANSMEM 36 0 3 82 Potential. 

FT TRANSMEM 535 557 Potential. 

FT TRANSMEM 577 599 Potential. 
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FT TRANSMEM 


619 


641 




Potential . 


FT TRANSMEM 


746 


768 




Potential . 


FT TRANSMEM 


796 


818 




Potential . 


FT TRANSMEM 


898 


920 




Potential . 


FT VARIANT 


108 


108 




V -> A (in dbSNP:rs3741903) . 


FT 








/FTId=VAR_021932 . 


FT VARIANT 


501 


501 




S -> A (in dbSNP:rsl860961) . 


FT 








/FTId=VAR 020331. 


SQ SEQUENCE 


999 AA; 


113616 


MW; B9B4F56161AE1B00 CRC64; 


Query Match 




29, 


.6%; 


; Score 1464; DB 1; Length 999, 


Best Local Sir 


nilarity 


37. 


.4%; 


; Pred. No. 1.2e-110; 



Matches 344; Conservative 167; Mismatches 284; Indels 124; Gaps 27; 

y 8 0 GSYGSTAHASEPGGQQAAACRAGSPAKPRIADFVLVWEEDLKLDRQQDSAARDRTDMHRT 139 

I I I I I I I : : I I : : I : 
b 125 GETGKEPHAGGPG DIELG-PLDALEEERKEQ 154 

y 140 WRETFLDNLRAAGLCVDQQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQ- 198 

III II I I I : : : I : : : : : : I I I I | | | : : | : | : : : 
b 155 -REEFEHNLMEAGLEL-EKDLENKSQGSIFVRIHAPWQVLAREAEFLKIKVPTKKEMYEI 212 

199 ASNWSAGLLAWLGIPNVLLEWPDVPPEYYSCRFRVNKLP RFLGSDNQ 2 46 

I : I I I : : I I I I : : : : | : 

213 KAGGSIAKKFSAAL QKLSSHLQPRV-PEHSNNKMKNLSYPFSREKMYLYNIQEK 265 

24 7 DTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQ 3 06 

INI: I : I : I I I : I I : I I : I : I : | | : | | | | | : : | : 

266 DTFFDNATRSRIVHEILKRTACS-RANNTMGINSLIANNIYEAAYPLHDGEYDSPEDD — 322 

30 7 APRLNQRQVLFQHWARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTL 366 

: I I : : I : I I I I : I : I : I I : I : I : I I I I I : I I I I I I I II : I : I : : I : I : 
323 MNDRKLLYQEWARYGVFYKFQPIDLIRKYFGEKIGLYFAWLGLYTSFLIPSSVIGVI 3 79 

36 7 VFLVGCFLVFSDIPTQELCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVF 425 

Mill : I I I : : I : I : : : I I I I I I I : I I I I I I III III: III 
380 VFLYGCATIEEDIPSREMCDQQNAFTMCPLCDKSCDYWNLSSACGTAQASHLFDNPATVF 439 

426 FSLFMALWAVLLLEYWKRKSATLAYRWDCSDYEDTEER PRPQFAA 470 

I I : I I I I I I : I I I I I MM: I : I || I I : : 

440 FSIFMALWATMFLENWKRLQMRLGYFWDLTGIEEEEERAQEHSRPEYETKVREKMLKESN 499 

471 -SAPMTAPNPIT GEDEPYFPERSRARRMLAGSWIWMVAVWMCLVS I ILYRAIM 525 

II I : I I : I I I : I : I : : : | : | | 

50 0 QSAVQKLETNTTECGDEDDEDKLTWKDRFPGYLMNFASILFMIALTFSIVFGVIVYRITT 559 

52 6 AIVVSRSGNTLLAAWASRI ASLTGSVVNLVFILILSKIYVSLAHVLTRWEMHRTQTK 5 82 

I : I I I I : : I : : I II II II : II : : I | | : | : : | : 

560 AAALS LNKATRSNVRVTVTATAVIINLWILILDEIYGAVAKWLTKIEVPKTEQT 614 

583 FEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCLIELA 641 
II: II I :: II I I I I : II II I II II II : I : I I I II II II I I : II 

615 FEERLILKAFLLKFVNAYSPIFYVAFFKGRFVGRPGSYVYVFDGYRMEECAPGGCLMELC 6 74 

642 QELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGA-SQGP — WEDDYE 6 97 
: I : I I : I I I : I II : I : : II II : II : : II : I I : I MM 
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6 75 IQLSIIMLGKQLIQNNIFEIGVPKLK KLFRKLKDETEAGETDSAHSKHPEQWDLDYS 731 

69 8 LVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAE 757 

I I II I I : I I : : I I I I I I : I I I : I I I I : I I I I I I : I : I I II : II I I III I 
732 LEP YTGLTPEYMEMI IQFGFVTLFVASFPLAPVFALLNNVIEVRLDAKKFVTELRRPDAV 791 

758 RAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHD — LRGFLNFTLA — 813 
I : I I I I I I I I : I : : I I I I I I : : I : I I I : I I I : : : : I : I I I : I I I : 

792 RTKDIGIWFDILSGIGKFSVISNAFVIAITSDFIPRLVYQYSYSHNGTLHGFVNHTLSFF 851 

814 RAPSSFAAAHNRTCRYRAFRD DDGHYSQTYWNLLAIRLAFVIVFEH 859 

: : I : II : : : I : : : | : || : | : || || || : | : : 

852 NVSQLKEGTQPENSQFDQEVQFCRFKDYREPPWAPNPYEFSKQYWFILSARLAFVIIFQN 911 

86 0 VVFSVGRLLDLLVPDIPESVEIKVKRE YYLAKQALAENEVLFGTNGTKDEQPKG 913 

: I : I : I : : I I I I : : : I : I : : I : I : I I : II 
912 LVMFLSVLVDWMIPDIPTDISDQIKKEKSLLVDFFLKE EHEKLKLMDEPALRSPGG 967 

914 SELSSHWTPFTVPKA-SQL 931 

: I : I III 

968 GDRSRSRAASSAPSGQSQL 986 



Search completed: June 24, 2008, 08:41:08 
Job time : 524 sees 
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